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Chapter 1 


LINEAR EQUATIONS 


1.1 Introduction to linear equations 

A linear equation in n unknowns xi, X2, • • ■ , x n is an equation of the form 


a\X\ + a 2 x 2 H V a n x n = b, 


where ai, a 2 , . . . , a n , b are given real numbers. 

For example, with x and y instead of x\ and x 2 , the linear equation 
2 x + 3 y = 6 describes the line passing through the points ( 3 , 0 ) and ( 0 , 2 ). 

Similarly, with x, y and z instead of x\, x 2 and X3, the linear equa- 
tion 2x + 3 y + 4 2 = 12 describes the plane passing through the points 
(6, 0 , 0 ), ( 0 , 4 , 0 ), ( 0 , 0 , 3 ). 

A system of m linear equations in n unknowns xi, x 2 , • ■ ■ , x n is a family 
of linear equations 


011X1 “t - 012X2 ~t“ * * * 4 " a\ n x n — b 1 
Cl2lXl + U22X2 + ' ' ' + 02 71X71 = 62 

Omixi A a m 2X2 + • • • + a mn x n = b m . 

We wish to determine if such a system has a solution, that is to find 
out if there exist numbers xi, x 2 , • • • , x n which satisfy each of the equations 
simultaneously. We say that the system is consistent if it has a solution. 
Otherwise the system is called inconsistent. 
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Note that the above system can be written concisely as 

n 

^ ~2 a ijXj = bi , % = 1 , 2 , • • • ,m. 

3 = 1 

The matrix 


an 

a 12 • ' 

0 1 n 

a 21 

022 ■ ' 

' ' 02 n 

Aral 

Om2 

' ' Omn 


is called the coefficient matrix of the system, while the matrix 



Oil 

012 ■ 

Oln 

bi 


021 

022 ■ 

■ ■ 02 n 

b 2 


Oml 

Om2 

' ‘ o mn 

bm 

is called the augmented 

matrix 

of the : 

system. 



Geometrically, solving a system of linear equations in two (or three) 
unknowns is equivalent to determining whether or not a family of lines (or 
planes) has a common point of intersection. 

EXAMPLE 1.1.1 Solve the equation 

2x + 2>y = 6 . 

Solution. The equation 2x + 3y = 6 is equivalent to 2x = 6 — 3 y or 
x = 3 — | y, where y is arbitrary. So there are infinitely many solutions. 

EXAMPLE 1.1.2 Solve the system 

x + y + z = 1 

x — y + z = 0 . 

Solution. We subtract the second equation from the first, to get 2y = 1 
and y = 7). Then x = y — z = \ — z, where z is arbitrary. Again there are 
infinitely many solutions. 

EXAMPLE 1.1.3 Find a polynomial of the form y = ao+aix+a 2 X 2 +azx 3 
which passes through the points (—3, —2), (—1, 2), (1, 5), (2, 1). 
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Solution. When x has the values —3, —1, 1, 2, then y takes corresponding 
values —2, 2, 5, 1 and we get four equations in the unknowns ao, «i, a 2 , 03: 


ao — 3ai + 9a 2 — 27a3 = —2 

do — a i + a 2 — 03 = 2 

T ®1 + a 2 + CI3 = 5 
ag + 2ai + 4a 2 + 803 = 1. 


This system has the unique solution ao = 93/20, ai = 221/120, a 2 = 
-23/20, 

03 = —41/120. So the required polynomial is 

93 221 23 2 41 , 

v = 20 + m x -20 x -uo x ■ 


In [26, pages 33-35] there are examples of systems of linear equations 
which arise from simple electrical networks using Kirchhoff’s laws for elec- 
trical circuits. 

Solving a system consisting of a single linear equation is easy. However if 
we are dealing with two or more equations, it is desirable to have a systematic 
method of determining if the system is consistent and to find all solutions. 

Instead of restricting ourselves to linear equations with rational or real 
coefficients, our theory goes over to the more general case where the coef- 
ficients belong to an arbitrary field. A field F is a set F which possesses 
operations of addition and multiplication which satisfy the familiar rules of 
rational arithmetic. There are ten basic properties that a field must have: 

THE FIELD AXIOMS. 

1. (a + b) + c = a + (b + c) for all a, b , c in F; 

2. ( ab)c = a(bc) for all a, b, c in F\ 

3. a + b = b + a for all a, b in F\ 

4. ab = ba for all a, b in F\ 

5. there exists an element 0 in F such that 0 + a = a for all a in F; 

6. there exists an element 1 in F such that la = a for all a in F; 
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7 . 


to every a in F, there corresponds an additive inverse —a in F. satis- 
fying 

a + (-a) = 0; 


8. to every non-zero a in F, there corresponds a multiplicative inverse 
a^ 1 in F. satisfying 


aa 1 = 1; 


9. a(b + c) = ab + ac for all a, 6, c in F; 

10. 0/1. 


With standard definitions such as a — b 
6/0, we have the following familiar rules: 


a + (—6) and 


a 

b 


ab 1 for 


-{a + 6) 
-(-a) 

-(a - 6) 
a c 


(-a) + (-6), (ab) l =a l b 
a, (a -1 ) -1 = a; 


,a i 6 


b~a, (-) = 

b a 

ad + be 


b + d 

bd ’ 

a c 

ac 

bd 

bd' 1 

ab 

b a ac 

ac 

(!) =y; 

-(ab) = 

(— a)6 = a(—b) 

f a \ ... 

—a a 

Kb) 

1 

I 1 

1 

1 

0a = 

0; 

(- a )- 1 = 

-(a’ 1 ). 


Fields which have only finitely many elements are of great interest in 
many parts of mathematics and its applications, for example to coding the- 
ory. It is easy to construct fields containing exactly p elements, where p is 
a prime number. First we must explain the idea of modular addition and 
modular multiplication. If a is an integer, we define a (mod p) to be the 
least remainder on dividing a by p: That is, if a = bp + r, where 6 and r are 
integers and 0 < r < p, then a (mod p) = r. 

For example, —1 (mod 2) = 1, 3 (mod 3) = 0, 5 (mod 3) = 2. 
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Then addition and multiplication mod p are defined by 

a® b = (a + b) (mod p) 
a<g>b = (ab) (mod p). 

For example, with p = 7 , we have 3 © 4 = 7 (mod 7 ) = 0 and 3 © 5 = 
15 (mod 7 ) = 1 . Here are the complete addition and multiplication tables 
mod 7 : 


© 

0 

1 

2 

3 

4 

5 

6 

© 

0 

1 

2 

3 

4 

5 

6 

0 

0 

1 

2 

3 

4 

5 

6 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

2 

3 

4 

5 

6 

0 

1 

0 

1 

2 

3 

4 

5 

6 

2 

2 

3 

4 

5 

6 

0 

1 

2 

0 

2 

4 

6 

1 

3 

5 

3 

3 

4 

5 

6 

0 

1 

2 

3 

0 

3 

6 

2 

5 

1 

4 

4 

4 

5 

6 

0 

1 

2 

3 

4 

0 

4 

1 

5 

2 

6 

3 

5 

5 

6 

0 

1 

2 

3 

4 

5 

0 

5 

3 

1 

6 

4 

2 

6 

6 

0 

1 

2 

3 

4 

5 

6 

0 

6 

5 

4 

3 

2 

1 


If we now let Z p = { 0 , 1 , ... ,p — 1 }, then it can be proved that Z p forms 
a field under the operations of modular addition and multiplication mod p. 
For example, the additive inverse of 3 in Z7 is 4 , so we write —3 = 4 when 
calculating in Z7. Also the multiplicative inverse of 3 in Z7 is 5 , so we write 
3 _1 = 5 when calculating in Z7. 

In practice, we write a© b and a© b as a + b and ab or ax b when dealing 
with linear equations over Z p . 

The simplest field is Z2, which consists of two elements 0 , 1 with addition 
satisfying 1 + 1 = 0. So in Z2, — 1 = 1 and the arithmetic involved in solving 
equations over Z2 is very simple. 

EXAMPLE 1.1.4 Solve the following system over Z2: 

x + y + z = 0 

X + z = 1 . 

Solution. We add the first equation to the second to get y = 1 . Then x = 
1 — z = 1 + z, with z arbitrary. Hence the solutions are (x, y, z) = ( 1 , 1 , 0 ) 
and (0, 1, 1). 

We use Q and M to denote the fields of rational and real numbers, re- 
spectively. Unless otherwise stated, the field used will be Q. 
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1.2 Solving linear equations 

We show how to solve any system of linear equations over an arbitrary field, 
using the GAUSS-JORDAN algorithm. We first need to define some terms. 

DEFINITION 1.2.1 (Row— echelon form) A matrix is in row-echelon 
form if 

(i) all zero rows (if any) are at the bottom of the matrix and 

(ii) if two successive rows are non-zero, the second row starts with more 
zeros than the first (moving from left to right). 

For example, the matrix 

' 0100 ' 

0 0 10 

0 0 0 0 

_ 0 0 0 0 _ 

is in row-echelon form, whereas the matrix 

' 0100 ' 

0 10 0 

0 0 0 0 

_ 0 0 0 0 _ 

is not in row-echelon form. 

The zero matrix of any size is always in row-echelon form. 


DEFINITION 1.2.2 (Reduced row echelon form) A matrix is in re- 
duced row-echelon form if 

1. it is in row-echelon form, 

2. the leading (leftmost non-zero) entry in each non-zero row is 1, 

3. all other elements of the column in which the leading entry 1 occurs 
are zeros. 


For example the matrices 


1 0 
0 1 


0 1 2 0 0 2 

0 0 0 1 0 3 

0 0 0 0 1 4 

0 0 0 0 0 0 


and 
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are in reduced row-echelon form, whereas the matrices 

'10 0 1 [ 1 2 0 ' 

010 and 010 

_ 0 0 2 J [ 0 0 0 _ 

are not in reduced row-echelon form, but are in row-echelon form. 

The zero matrix of any size is always in reduced row-echelon form. 

Notation. If a matrix is in reduced row-echelon form, it is useful to denote 
the column numbers in which the leading entries 1 occur, by c i, C 2 , . . . , c r , 
with the remaining column numbers being denoted by c r + 1 , . . . , c n , where 
r is the number of non-zero rows. For example, in the 4x6 matrix above, 
we have r = 3, c\ = 2, C 2 = 4, C 3 = 5, C 4 = 1, C 5 = 3, cq = 6 . 

The following operations are the ones used on systems of linear equations 
and do not change the solutions. 

DEFINITION 1.2.3 (Elementary row operations) There are three 
types of elementary row operations that can be performed on matrices: 

1. Interchanging two rows: 

Ri <-> Rj interchanges rows i and j. 

2. Multiplying a row by a non-zero scalar: 

Ri — > tRj multiplies row i by the non-zero scalar t. 

3. Adding a multiple of one row to another row: 

Rj — > Rj + t.Ri adds t times row i to row j. 

DEFINITION 1.2.4 [Row equivalence] Matrix A is row-equivalent to ma- 
trix B if B is obtained from A by a sequence of elementary row operations. 

EXAMPLE 1.2.1 Working from left to right, 

' 1 2 0 1 [ 1 2 0 ' 

A = 2 11 R'2 — * R 2 T 2I ? 3 4-15 

_ i 2 J |_ 1 -1 2 _ 

' 1 2 0 1 [ 2 4 0 ' 

R 2 ^ R 3 1-12 -> 2i?i 1-12 = B. 

4 -1 5 J [4-15 
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Thus A is row-equivalent to B. Clearly B is also row-equivalent to A, by 
performing the inverse row-operations R\ — > \Ri, R 2 R 3 , R 2 — * ► R 2 — 2i?3 
on B. 

It is not difficult to prove that if A and B are row-equivalent augmented 
matrices of two systems of linear equations, then the two systems have the 
same solution sets - a solution of the one system is a solution of the other. 
For example the systems whose augmented matrices are A and B in the 
above example are respectively 

( x + 2y = 0 ( 2x + Ay = 0 

< 2 x + y = 1 and < x — y = 2 

y x — y = 2 [ 4x — y = 5 

and these systems have precisely the same solutions. 

1.3 The Gauss— Jordan algorithm 

We now describe the GAUSS-JORDAN ALGORITHM. This is a process 
which starts with a given matrix A and produces a matrix B in reduced row- 
echelon form, which is row-equivalent to A. If A is the augmented matrix 
of a system of linear equations, then B will be a much simpler matrix than 
A from which the consistency or inconsistency of the corresponding system 
is immediately apparent and in fact the complete solution of the system can 
be read off. 

STEP 1 . 

Find the first non-zero column moving from left to right, (column c{) 
and select a non-zero entry from this column. By interchanging rows, if 
necessary, ensure that the first entry in this column is non-zero. Multiply 
row 1 by the multiplicative inverse of ai Cl thereby converting ai Cl to 1. For 
each non-zero element a* Cl , i > 1, (if any) in column ci, add — ai Cl times 
row 1 to row i, thereby ensuring that all elements in column c\, apart from 
the first, are zero. 

STEP 2 . If the matrix obtained at Step 1 has its 2nd, . . . , mth rows all 
zero, the matrix is in reduced row-echelon form. Otherwise suppose that 
the first column which has a non-zero element in the rows below the first is 
column C 2 • Then ci < C 2 . By interchanging rows below the first, if necessary, 
ensure that a 2 C2 is non-zero. Then convert a 2 C2 to 1 and by adding suitable 
multiples of row 2 to the remaing rows, where necessary, ensure that all 
remaining elements in column C 2 are zero. 
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The process is repeated and will eventually stop after r steps, either 
because we run out of rows, or because we run out of non-zero columns. In 
general, the final matrix will be in reduced row-echelon form and will have 
r non-zero rows, with leading entries 1 in columns ci, . . . , c r , respectively. 

EXAMPLE 1.3.1 


' 0 

0 

4 

0 ' 


' 2 

2 

-2 

5 ' 

2 

2 

-2 

5 

R.\ <— » R 2 

0 

0 

4 

0 

5 

5 

-1 

5 


5 

5 

-1 

5 


R\ — > i R\ 


R 2 —> \R2 


1 

1 

-1 

5 

2 






' 1 

1 

-1 

5 1 

2 

0 

0 

4 

0 



r 3 - 

- R :>, 

- 5Ri 

0 

0 

4 

0 

5 

5 

-1 

5 

_ 





0 

0 

4 ■ 

15 

2 - 

1 

1 

-1 



1 1 






' 1 1 

0 

0 

0 

1 


z 

0 

{ 

R\ - 

-9 JlXI + R 2 


0 0 

1 

0 

0 

4 

— 

15 

9 

\ 

Ih - 

-> R 3 — IR 2 


0 0 

0 - 


R3 —> j § R-i 

' 1 1 0 § ' 
0010 

Ri R \ — 

'110 0' 
0 0 10 


0001 


0 0 0 1 


5 

2 

0 

15 

2 


The last matrix is in reduced row-echelon form. 


REMARK 1.3.1 It is possible to show that a given matrix over an ar- 
bitrary field is row-equivalent to precisely one matrix which is in reduced 
row-echelon form. 

A flow-chart for the Gauss- Jordan algorithm, based on [1, page 83] is pre- 
sented in figure 1.1 below. 


1.4 Systematic solution of linear systems. 

Suppose a system of m linear equations in n unknowns aq, • • • , x n has aug- 
mented matrix A and that A is row-equivalent to a matrix B which is in 
reduced row-echelon form, via the Gauss-Jordan algorithm. Then A and B 
are m x (n + 1). Suppose that B has r non-zero rows and that the leading 
entry 1 in row i occurs in column number Cj, for 1 < i < r. Then 

l<ci<C 2 < • • ■ , < c r < n + 1 . 
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START 

I 

Input A, m, n 

\ 

i = 1, j = 1 


Are the elements in the 
jth column on and below 
the ith row all zero? 


3=3 + 1 


Let a p j be the first non-zero 
element in column j on or 
below the ith row 


Is j = n? 
Yes 


Is p = i? 
Yes 


Interchange the 
pth and ith rows 


Divide the ith row by aij 



Subtract a q j 
row from the 
for q = 1 , . . 

times the ith 
gth row for 
• • m (q -/ i) 



Set c 

i = 3 



1— 1 

W 

<s>. 

= ml 

No^ 


= n? 

Yes 


Print A, 

Cl , . . . , Cj 


STOP 


Figure 1.1: Gauss- Jordan algorithm. 
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Also assume that the remaining column numbers are c r + 1 , • • • , c n +i, where 
1 < c r+ 1 < c r+ 2 < • • • < c n < n + 1. 

Case 1: c r = n + 1. The system is inconsistent. For the last non-zero 
row of B is [0, 0, • • • , 1] and the corresponding equation is 

O.xi + 0 x 2 H b 0x n = 1, 

which has no solutions. Consequently the original system has no solutions. 

Case 2: c r < n. The system of equations corresponding to the non-zero 
rows of B is consistent. First notice that r < n here. 

If r = n , then c\ = 1, C 2 = 2, • • ■ , c n = n and 

'1 0 • • • 0 ri, 

0 1 • • • 0 d 2 

B = 0 0 • • • 1 d n • 

0 0 ••• 0 0 

_ 0 0 • • • 0 0 

There is a unique solution x\ = d\, x 2 = d 2 , ■ ■ ■ , x n = d n . 

If r < n, there will be more than one solution (infinitely many if the 
field is infinite). For all solutions are obtained by taking the unknowns 
x ci , • • • , x Cr as dependent unknowns and using the r equations correspond- 
ing to the non-zero rows of B to express these unknowns in terms of the 
remaining independent unknowns x Cr+1 , ..., x Cn , which can take on arbi- 
trary values: 

X C1 = frln+l — bic r+ i x c r +i — • • • — bic n X Cn 

Xc r — ^rn +1 b rCr + 1 x Cr _^i ■ b rCn x Cri . 

In particular, taking x Cr+1 = 0, . . . , x Cn _ 1 = 0 and x Cn = 0,1 respectively, 
produces at least two solutions. 

EXAMPLE 1.4.1 Solve the system 

x + y = 0 

x — y = 1 

4x + 2 y = 1. 
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Solution. The augmented matrix of the system is 

'1 10 ' 

A = 1-11 

_ 4 2 1 _ 

which is row equivalent to 

'io y 

B= 0 1 — i 

_oo 6 _ 

We read off the unique solution x = 3 , y = — \ • 

(Here n = 2, r = 2, c\ = 1, C 2 = 2. Also c r = C2 = 2<3 = n+ l and 
r = n.) 

EXAMPLE 1.4.2 Solve the system 

2x\ + 2 x 2 ~ 2 x 3 = 5 

7xi + 7x 2 + x 3 = 10 
5xi + 5 x 2 — x 3 = 5. 

Solution. The augmented matrix is 

' 2 2 -2 5 ' 

A = 7 7 1 10 

_ 5 5 -1 5 _ 

which is row equivalent to 

'110 0 ' 

B= 0010. 

_ 0 0 0 1 _ 

We read off inconsistency for the original system. 

(Here n = 3, r = 3, c 3 = 1, C 2 = 3. Also c r = c 3 = 4 = n + 1.) 

EXAMPLE 1.4.3 Solve the system 


xi - x 2 + x 3 
xi + x 2 - x 3 


1 

2. 
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Solution. The augmented matrix is 


A 


1-1 11 
1 1-12 


which is row equivalent to 


B = 


10 0 | 

0 1 -1 I 


The complete solution is x\ = §, X 2 = ^ + X 3 , with X 3 arbitrary. 

(Here n = 3, r = 2, ci = 1, ci = 2. Also c r = C 2 = 2 < 4 = n + 1 and 
r < n.) 


EXAMPLE 1.4.4 Solve the system 


6x3 + 2x4 — 4x5 — 8x6 = 8 

3x3 + X4 — 2x5 — 4 x 6 = 4 
2xi — 3x2 + X3 + 4x4 ~ 7x5 + xq = 2 

6X1—9X2 + 11X4 — 19X5 + 3X6 = 1. 


Solution. The augmented 

matrix is 






' 0 

0 

6 

2 

-4 

-8 

8 ' 

A = 

0 

0 

3 

1 

-2 

-4 

4 

2 

-3 

1 

4 

-7 

1 

2 


6 

-9 

0 

11 - 

-19 

3 

1 _ 

which is row equivalent to 








' 1 

3 

2 

0 

11 

6, 

19 
" G 

0 

1 -| 

24 

B = 

0 

0 

1 

1 

2 

0 

5 

3 

3 

1 

4 

0 

0 

0 

0 

0 

1 


0 

0 

0 

0 

0 

0 

0 _ 

The complete solution is 







Xl = 

k + i x 2 

11 

6 

: X4 + 

f*5 

5 


x 3 = 

5 

3 

3 X 4 + |x 5 , 





X 6 = 4, 

with X2, X4, X5 arbitrary. 

(Here n = 6 , r = 3, ci = 1, C 2 = 3, C 3 = 6 ; c r = C 3 = 6 < 7 = n + 1; r < n.) 
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EXAMPLE 1.4.5 Find the rational number t for which the following sys- 
tem is consistent and solve the system for this value of t. 

x + y = 2 

x — y = 0 

3 x — y = t. 

Solution. The augmented matrix of the system is 

'1 12 " 

A = 1-10 

_3 -1 t _ 

which is row-equivalent to the simpler matrix 

"11 2 ' 

B= 01 1 . 

_ 0 0 t- 2 

Hence if t ^ 2 the system is inconsistent. If t = 2 the system is consistent 
and 



' 1 

1 

2 ' 


' 1 

0 

1 ' 

B = 

0 

1 

1 

-> 

0 

1 

1 


0 

0 

0 


0 

0 

0 


We read off the solution x = 1, y = 1. 

EXAMPLE 1.4.6 For which rationals a and b does the following system 
have (i) no solution, (ii) a unique solution, (iii) infinitely many solutions? 

x — Ty + 3z = 4 
2x — 3 y + az = 5 
3x — 4y + 5z = b. 

Solution. The augmented matrix of the system is 

" 1 -2 3 4 ' 

A= 2 -3 a 5 
3—456 
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f i ?2 ► R‘2 ~ 2R\ 

\ Rs — >• — 3Ri 


R3 — 1 " R3 — 


1 

0 

0 


1-2 3 4 

0 1 a - 6 -3 

0 2 —4 b — 12 


-2 3 4 

1 a - 6 -3 

0 —2a + 8 6 — 6 


= B. 


Case 1. a / 4. Then 
a matrix of the form 


—2a + 8/0 and we see that B can be reduced to 


1 0 0 
0 1 0 
0 0 1 


u 

V 

b - 6 

-2a+8 - 


and we have the unique solution x = u, y = v,z = (b — 6 ) / ( — 2 a + 8 ). 
Case 2. a = 4. Then 


B = 


1-2 3 4 

0 1-2 -3 
000 6-6 


If 6 / 6 we get no solution, whereas if 6 = 6 then 


B = 


1-234 
0 1-2-3 

0 0 0 0 


-Rl — > R\ + 2i?2 


read off the complete solution x = — 2 + z, y = 


10 - 1-2 
0 1-2-3 
0 0 0 0 


We 


—3 + 2z, with z arbitrary. 


EXAMPLE 1.4.7 Find the reduced row-echelon form of the following ma- 
trix over Z 3 : 

' 2121 ' 

2 2 10' 

Hence solve the system 


2x + y + 2z = 1 

2x + 2y + z = 0 


over Z 3 . 


Solution. 
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2 1 2 
2 2 1 

R± — > 2Ri 


1 

0 

' 1 
0 


R'2 R'2 — Ri 


2 1 2 

0 1 -1 


2 1 2 
1 2 2 


i?l — > Ri + i?2 


1 

-1 

1 0 0 

0 1 2 


2 1 
0 1 

1 ' 

2 ' 


2 1 
2 2 


The last matrix is in reduced row-echelon form. 

To solve the system of equations whose augmented matrix is the given 
matrix over Z3, we see from the reduced row-echelon form that x = 1 and 
y = 2 — 2 z = 2 + z, where z = 0 , 1 , 2 . Hence there are three solutions 
to the given system of linear equations: (x, y, z ) = (1, 2, 0), (1, 0, 1) and 
( 1 , 1 , 2 ). 


1.5 Homogeneous systems 

A system of homogeneous linear equations is a system of the form 

CT 1 1 1 T 012*^2 “k ' ' ' a\ n^n — 6 

0212-1 T 022212 + • • • + d2nXn = 0 


a>m i%i T a m 2 x 2 T ■ ■ ■ T a mn x n — 0 . 

Such a system is always consistent as x\ = 0 , • • • , x n = 0 is a solution. 
This solution is called the trivial solution. Any other solution is called a 
non-trivial solution. 

For example the homogeneous system 

x — y = 0 
x + y = 0 

has only the trivial solution, whereas the homogeneous system 

x — y + z = 0 
x+y+z = 0 

has the complete solution x = —z, y = 0 , z arbitrary. In particular, taking 
z = 1 gives the non-trivial solution x = — 1, y = 0, z = 1 . 

There is simple but fundamental theorem concerning homogeneous sys- 
tems. 

THEOREM 1.5.1 A homogeneous system of m linear equations in n un- 
knowns always has a non-trivial solution if m < n. 
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Proof. Suppose that m < n and that the coefficient matrix of the system 
is row-equivalent to B, a matrix in reduced row-echelon form. Let r be the 
number of non-zero rows in B. Then r < m < n and hence n — r > 0 and 
so the number n — r of arbitrary unknowns is in fact positive. Taking one 
of these unknowns to be 1 gives a non-trivial solution. 

REMARK 1.5.1 Let two systems of homogeneous equations in n un- 
knowns have coefficient matrices A and B, respectively. If each row of B is 
a linear combination of the rows of A (i.e. a sum of multiples of the rows 
of A) and each row of A is a linear combination of the rows of B, then it is 
easy to prove that the two systems have identical solutions. The converse is 
true, but is not easy to prove. Similarly if A and B have the same reduced 
row-echelon form, apart from possibly zero rows, then the two systems have 
identical solutions and conversely. 

There is a similar situation in the case of two systems of linear equations 
(not necessarily homogeneous), with the proviso that in the statement of 
the converse, the extra condition that both the systems are consistent, is 
needed. 


1.6 PROBLEMS 


1. Which of the following matrices of rationals is in reduced row-echelon 
form? 


(a) 


(d) 


' 1 

0 

0 

0 

-3 ' 


' 0 

1 

0 

0 

5 ' 


' 0 

1 

0 

0 ' 

0 

0 

1 

0 

4 

(b) 

0 

0 

1 

0 

-4 

(c) 

0 

0 

1 

0 

0 

0 

0 

1 

2 


0 

0 

0 

-1 

3 


0 

1 

0 

-2 


' 0 

1 

0 

0 

2 ' 


" 1 

2 

0 

0 

0 ' 


' 0 

0 

0 

0 ' 

0 

0 

0 

0 

-1 

(e) 

0 

0 

1 

0 

0 

(f) 

0 

0 

1 

2 

0 

0 

0 

1 

4 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 _ 


0 

0 

0 

0 

0 _ 


0 

0 

0 

0 _ 


(g) 


1 0 0 0 1 

0 10 0 2 
0001-1 
0 0 0 0 0 


[Answers: (a), (e), (g)] 


2. Find reduced row-echelon forms which are row-equivalent to the following 
matrices: 






” 1 1 1 ” 


to 

o 

o 

'0 0 0' 
2 4 0 

(b) 

'013' 
12 4 

(c) 

1 1 0 

(d) 

0 0 0 





_ 1 0 0 _ 


-4 0 0 _ 
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[Answers: 







'10 0' 


'10 0' 

(a) 

'12 0' 
0 0 0 

(b) 

' 1 0 -2 ' 
0 1 3 

(c) 

0 1 0 

(d) 

0 0 0 






_ 0 0 ! _ 


0 

0 

0 


3 . Solve the following systems of linear equations by reducing the augmented 
matrix to reduced row-echelon form: 


(a) x + y + z 

Tx + 3 y — z 
x — y — z 


2 (6) X\ + X2 — X3 + 2X4 = 10 

8 3 x - i — X2 + 7.T3 + 4.T4 = 1 

—8 — 5 xi + 3x2 — 15 x 3 — 6x4 = 9 


(c) 3 x — y + 7 z 
2 x — y + 4 z 
x — y + z 
6x — 4 y + 10 z 


0 (d) 

1 

2 

1 2xi 

3 


2X2 + 3X3 “ 4x4 
2 x 3 + 3X4 
+ 2 x 2 - 5 x 3 + 2 x 4 

2xi — 6x3 + 9x4 


1 

4 

4 

7 


[Answers: (a) x = — 3 , y = 2 = (b) inconsistent; 

(c) x = — \ — 3 z, y = — | — 2 z, with z arbitrary; 

(d) xi = ^ — 9x4, X2 = — § + xyX4, X3 = 2 — |x4, with X4 arbitrary.] 


4 . Show that the following system is consistent if and only if c = 2 a — 36 
and solve the system in this case. 


2 x — y + 3 z = a 
3 x + y — 5 z = b 
— 5 x — 5 y + 21 z = c. 


[Answer: x=^ + | z, y = 3a + 2b + ^-z, with z arbitrary.] 

5 . Find the value of t for which the following system is consistent and solve 
the system for this value of t. 


x + y = 1 

tx + y = t 
(1 + t)x + 2 y = 3 . 


[Answer: t = 2 ; x = 1 , y = 0 .] 
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6. Solve the homogeneous system 

— 3xi + X 2 + X 3 + X 4 = 0 

X\ — 3X2 + X 3 + X 4 = 0 

X\ + X2 — 3X3 + = 0 

Xi + X2 + X 3 — 3X4 = 0 . 

[Answer: xi = X2 = X3 = X4, with X4 arbitrary.] 

7 . For which rational numbers A does the homogeneous system 

x + (A — 3 )y = 0 
(A — 3 )x + y = 0 

have a non-trivial solution? 

[Answer: A = 2 , 4 .] 

8. Solve the homogeneous system 

3 xi + x 2 + x 3 + X 4 = 0 

5xi — X2 + X3 — X4 = 0. 

[Answer: xi = — \x%, X2 = —5X3 — X4, with X3 and X4 arbitrary.] 

9 . Let A be the coefficient matrix of the following homogeneous system of 
n equations in n unknowns: 

(1 - n)xi + x 2 H b x n = 0 

xi + (1 — n)x 2 H b x n = 0 

... = 0 

xi+x 2 H b (1 - n)x n = 0. 

Find the reduced row-echelon form of A and hence, or otherwise, prove that 
the solution of the above system is xi = x 2 = • • • = x n , with x n arbitrary. 

10 . Let A = a b a matrix over a field F . Prove that A is row- 

c a 

equivalent to J ^ if ad — be 7b 0, but is row-equivalent to a matrix 
whose second row is zero, if ad — be = 0. 
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11. For which rational numbers a does the following system have (i) no 
solutions (ii) exactly one solution (iii) infinitely many solutions? 

x + 2y — 3z = 4 
3x — y + 5z = 2 

4x + y + (a 2 — 14)z = a + 2. 

[Answer: a = —4, no solution; a = 4, infinitely many solutions; a ^ ±4, 
exactly one solution.] 

12 . Solve the following system of homogeneous equations over Z 2 - 

xi + x 3 + x 5 = 0 

X 2 + x 4 + x 5 = 0 

X1+X2 + X3 + X4 = 0 

x 3 + x 4 = 0 . 


[Answer: x\ = X 2 = X 4 + X 5 , X 3 = X 4 , with 24 and 25 arbitrary elements of 

z 2 .] 

13. Solve the following systems of linear equations over Z 5 : 

(a) 2x + y + 3z = 4 ( b ) 2x + y + 3z = 4 

4x + y + 4z = 1 4x + y + 4z = 1 

32 + y + 2z = 0 x + y = 3. 

[Answer: (a) 2 = 1, y = 2, z = 0; (b) 2 = 1 + 2z, y = 2 + 3z, with z an 
arbitrary element of Z 5 .] 

14. If (ai, . . . , a n ) and (/3i, . . . , /3 n ) are solutions of a system of linear equa- 
tions, prove that 


((1 - t)ai + t(3 1 , . . . , (1 - t)a n + tp n ) 


is also a solution. 

15. If (oq, . . . , a n ) is a solution of a system of linear equations, prove that 
the complete solution is given by 21 = cti + yi, ■ ■ ■ , x n = a n + y n , where 
(?/i , . . . , y n ) is the general solution of the associated homogeneous system. 
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16. Find the values of a and b for which the following system is consistent. 
Also find the complete solution when a = 6 = 2. 

x + y — z + w = 1 
ax + y + z + w = b 
3x + 2y + aw = 1 + a. 

[Answer: a/2ora = 2 = 6;x=l — 2 z, y = 3z — w, with z, w arbitrary.] 

17. Let F = {0, 1, a, 6} be a field consisting of 4 elements. 

(a) Determine the addition and multiplication tables of F. (Hint: Prove 
that the elements 1 + 0, 1 + 1, 1 + a, 1 + 6 are distinct and deduce that 
1 + 1 + 1 + 1 = 0; then deduce that 1 + 1 = 0.) 

(b) A matrix A, whose elements belong to F. is defined by 

1 a 6 a 

A = a 6 6 1 , 

_1 1 1 a _ 

prove that the reduced row-echelon form of A is given by the matrix 

' 1 0 0 0 ' 

B = 0 10 6 . 

0 0 11 
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MATRICES 


2.1 Matrix arithmetic 

A matrix over a field F is a rectangular array of elements from F. The sym- 
bol M mxn (F ) denotes the collection of all m x n matrices over F. Matrices 
will usually be denoted by capital letters and the equation A = [a^] means 
that the element in the i— th row and j— th column of the matrix A equals 
aij. It is also occasionally convenient to write = (A)ij. For the present, 
all matrices will have rational entries, unless otherwise stated. 

EXAMPLE 2.1.1 The formula = l/(i + j) for 1<*<3, 1 <j< 4 
defines a 3 x 4 matrix A = [a t j\ ■ namely 

1 l l i-i 

2 3 4 5 

A — 111! 

^ 3 4 5 6 ' 

fill 

4 5 6 7 -1 

DEFINITION 2.1.1 (Equality of matrices) Matrices A and B are said 
to be equal if A and B have the same size and corresponding elements are 
equal; that is A and B € M mxn (F ) and A = [(%•], B = [b tJ \. with a t j = bij 
for 1 < i < m, 1 < j < n. 

DEFINITION 2.1.2 (Addition of matrices) Let A = [aij] and B = 
[bij] be of the same size. Then A + B is the matrix obtained by adding 
corresponding elements of A and B; that is 

A + B = + [bij\ = [a^ + bij]. 
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DEFINITION 2.1.3 (Scalar multiple of a matrix) Let A = [a t j] and 
t G F (that is t is a scalar ). Then tA is the matrix obtained by multiplying 
all elements of A by t; that is 

tA = t[aij } = [ taij\ . 

DEFINITION 2.1.4 (Additive inverse of a matrix) Let A = [a^] . 
Then —A is the matrix obtained by replacing the elements of A by their 
additive inverses; that is 


A — [ aij\ — [ aij\. 

DEFINITION 2.1.5 (Subtraction of matrices) Matrix subtraction is 
defined for two matrices A = [aij] and B = [ bij ] of the same size, in the 
usual way; that is 

A — B = [ aij \ — [bij] = [aij — bij\. 

DEFINITION 2.1.6 (The zero matrix) For each m, n the matrix in 
M m xn{B), all of whose elements are zero, is called the zero matrix (of size 
m x n) and is denoted by the symbol 0. 

The matrix operations of addition, scalar multiplication, additive inverse 
and subtraction satisfy the usual laws of arithmetic. (In what follows, s and 
t will be arbitrary scalars and A, B , C are matrices of the same size.) 

1. (A + B) + C = A + {B + C)- 

2. A + B = B + A; 

3. 0 + A = A; 

4. A + (-A) =0; 

5. (s + t)A = sA + tA, ( s — t)A = sA — tA ; 

6. t(A + B) = tA + tB, t(A — B) = t A — tB; 

7. s(tA) = ( si) A ; 

8.1 A = A, 0A = 0, (-1 )A = -A; 

9. tA = 0 =4- t = 0 or A = 0. 

Other similar properties will be used when needed. 
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DEFINITION 2.1.7 (Matrix product) Let A = [a v j\ be a matrix of 
size m x n and B = [bjk] be a matrix of size n x p; (that is the number 
of columns of A equals the number of rows of B). Then AB is the m x p 
matrix C = [c t k] whose ( i , k )— th element is defined by the formula 

n 

C-ik — ^ ' dijbjk — dubik + ' ' ' + a in b n k. 

3 = 1 

EXAMPLE 2.1.2 


'12' 

'56' 


Ix5+2x7 Ix6+2x8 

3 4 

_ 7 8 _ 


3x 5 + 4 x 7 3x6 + 4x8 

'56' 

_ 7 8 _ 

'12' 
3 4 

= 

'23 34 ' 
3! 46 


'12' 
3 4 

'56' 
_ 7 8 _ 


3. 


3 4 


3 4 
6 8 


4. [ 3 



; 11 ] ; 


L 

J 



' 1 -1 ' 


' 1 -1 ' 


r o oi 

1 -1 


1 -1 


o 

o 


Matrix multiplication obeys many of the familiar laws of arithmetic apart 
from the commutative law. 


1. ( AB)C = A(BC) if A, B , C are m x n, n x p, p x q, respectively; 


2. t(AB) = (tA)B = A(tB), A(-B) = (- A)B = -(AB); 

3. ( A + B)C = AC + BC if A and B are m x n and C is n x p; 

4. D(A + B) = DA + DB if A and B are m x n and D is p x m. 

We prove the associative law only: 

First observe that ( AB)C and A(BC) are both of size m x q. 

Let A = [aij\, B = [bjk], C = [ c k i ]. Then 

p p / n \ 

{{AB)C) -i = 'y ] (AB)jkCki = EE Q'ijbjk J Ckl 
k= 1 k= 1 \J=1 J 

p n 

= xx Q*ij bj k Ckl • 

k = 1 j = 1 
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Similarly 

n p 

(yBC)) u = EE Q'ijbjkCkl • 

j= 1 k= 1 

However the double summations are equal. For sums of the form 

n p p n 

J2J2 d i k and SZX* 

j= 1 k=l k= 1 j=l 

represent the sum of the np elements of the rectangular array [djk ] , by rows 
and by columns, respectively. Consequently 

((AB)C) a = ( A(BC)) u 

for 1 < i < m, I < l < q. Hence ( AB)C = A(BC). 

The system of m linear equations in n unknowns 
^llAl T ^12^2 T * * ' T Oi n X n — 

0-21^1 + 0-22X2 + ' ' ' + 02nX n = &2 

Om.\X\ + O m 2X2 T ' ' ' T O mn X n = b m 
is equivalent to a single matrix equation 


an o\2 

021 022 

Oln 

■ ■ 02 n 


X\ 

X2 


' bi ' 

62 

Oml O m 2 

Omn 


Xn 


bm 


that is AX = B, where A = [ay] is the coefficient matrix of the system, 


X\ 


' h ~ 


X2 

is the vector of unknowns and B = 

b 2 

is the vector of 

X n 


bm 



constants. 

Another useful matrix equation equivalent to the above system of linear 
equations is 



an 


012 



Uln 


' 61 ' 

X] 

021 

+ X 2 

022 


+ x n 

n 

= 

b 2 


Oml 


Om2 



^mri 


bm 
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EXAMPLE 2.1.3 The system 

x + y + z = 1 

x — y + z = 0. 

is equivalent to the matrix equation 

'1 11 
1 -1 1 

and to the equation 



' 1 ' 


l ' 


' 1 ' 


' 1 ' 

X 

1 

Ay 

-l 

+ z 

1 

— 

0 


x 

y 

z 


1 

0 


2.2 Linear transformations 

An n— dimensional column vector is an n x 1 matrix over F. The collection 
of all ?r-dimensional column vectors is denoted by F n . 

Every matrix is associated with an important type of function called a 
linear transformation. 

DEFINITION 2.2.1 (Linear transformation) With A € M mxn (F), we 
associate the function T A : F n —> F m defined by T a (X) = AX for all 


X € F n . 
form 

More explicitly, 

using components, 

the above function takes the 


m = 

auxi + a\2X2 + • • 

+ &\n%n 


y2 = 

®2lAl + «22®2 + ' ' 

+ &2n%n 


ym = 

®mi®i T a m 2 X 2 T ■ 

* * 1 0'mn%ri') 


where y\. y 2 , ■ ■ ■ , y m are the components of the column vector Ta{X). 

The function just defined has the property that 

T a (sX + tY) = sT a (X) + tT A (Y ) (2.1) 

for all s, t G F and all n-dinrensional column vectors X, Y . For 


T a (sX + tY) = A(sX + tY) = s{AX) + t(AY) = sT a (X ) + tT A (Y). 
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REMARK 2.2.1 It is easy to prove that if T : F n —> F m is a function 
satisfying equation 2.1, then T = Ta-, where A is the m x n matrix whose 
columns are T(E \), . . . , T(E n ), respectively, where E\, ... ,E n are the Tr- 
idimensional unit vectors defined by 



" 1 ' 


' 0 ' 


0 


0 

Ei = 

0 

, ... , E n — 

1 


One well-known example of a linear transformation arises from rotating 
the (x, T/)-plane in 2-dinrensional Euclidean space, anticlockwise through 8 
radians. Here a point (x, y ) will be transformed into the point (xi, yi), 
where 


xi = x cos 8 — y sin 8 
yi = xsind + ycosd. 

In 3-dinrensional Euclidean space, the equations 

xi = x cos 8 — y sin 6 , y\ = x sin 8 + y cos 8 , z\ = z; 
xi = x, y i = y cos cp — z sin (p, z\ = y sin < p + z cos <p\ 

xi = x cos ip — z sin ip, y\ = y, z\ = x sin ip + z cos ip; 

correspond to rotations about the positive z, x, y- axes, anticlockwise through 
8, (p, ip radians, respectively. 

The product of two matrices is related to the product of the correspond- 
ing linear transformations: 

If A is m x n and B is n xp, then the function T XT b '■ F p — > F m , obtained 
by first performing Tg, then Ta is in fact equal to the linear transformation 
Tab- For if X e F p , we have 

T a Tb(X) = A(BX) = ( AB)X = T AB (X). 

The following example is useful for producing rotations in 3-dinrensional 
animated design. (See [27, pages 97-112].) 

EXAMPLE 2.2.1 The linear transformation resulting from successively 
rotating 3-dinrensional space about the positive z, x, y- axes, anticlockwise 
through 8, (p, ip radians respectively, is equal to Tabc-, where 
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(xi,yi) 


Figure 2.1: Reflection in a line. 



cos 9 

— sin 9 

0 ' 


' 1 

0 

0 

c = 

sin 9 

cos 9 

0 

, B = 

0 

cos (p 

— sin <p 


0 

0 

1 


0 

sin <p 

cos cp 


A 


cos ip 0 — sin ip 

0 1 0 

sin ip 0 cos ip 


The matrix ABC is quite complicated: 



cos ip 

0 

— sin ip 


cos 9 

— sin 9 

0 

A(BC) = 

0 

1 

0 


cos cp sin 9 

cos (p cos 9 

— sin i p 


sin ip 

0 

cos ip 


sin cp sin 9 

sin ( p cos 9 

cos cp 


cos ip cos 9 — sin ip sin cp sin 9 — cos ip sin 9 — sin ip sin (p sin 9 — sin ip cos (p 

cos i p sin 9 cos <p cos 9 — sin cp 

sin ip cos 9 + cos ip sin (p sin 9 — sin ip sin 9 + cos ip sin (p cos 9 cos ip cos <p 


EXAMPLE 2.2.2 Another example of a linear transformation arising from 
geometry is reflection of the plane in a line l inclined at an angle 9 to the 
positive x-axis. 

We reduce the problem to the simpler case 9 = 0, where the equations 
of transformation are x\ = x, y\ = —y. First rotate the plane clockwise 
through 9 radians, thereby taking l into the 2 -axis; next reflect the plane in 
the 2 -axis; then rotate the plane anticlockwise through 9 radians, thereby 
restoring l to its original position. 
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In terms of matrices, we get transformation equations 


Xi 


cos 6 

— sin 9 


' 1 

0 ' 


cos (- 

-9) 

— sin (—9) 


X 

. yi . 


sin 9 

cos 9 


0 

-1 


sin (- 

-9) 

cos (—9) 


_ y . 



cos 9 

sin 9 


cos 9 

sin 9 


X 





sin 9 

— cos 9 


— sin 9 

cos 9 


_ y _ 




cos 29 sin 29 


X 

sin 29 — cos 29 


. y . 


The more general transformation 


Xl 


cos 9 

— sin 9 


X 


u 

= a 

sin 9 




+ 


. 2/i . 


cos 6 


. y _ 


V 


represents a rotation, followed by a scaling and then by a translation. Such 
transformations are important in computer graphics. See [23, 24]. 


EXAMPLE 2.2.3 Our last example of a geometrical linear transformation 
arises from projecting the plane onto a line l through the origin, inclined 
at angle 9 to the positive x-axis. Again we reduce that problem to the 
simpler case where l is the x-axis and the equations of transformation are 
xi = x ,yi = 0. 

In terms of matrices, we get transformation equations 


X\ 


cos 9 

— sin 9 


. 2/1 . 


sin 9 

cos 9 



cos (—9) 
sin (—9) 


— sin (— 9) 
cos (—9) 
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cos 6 0 


cos 6 sin 9 


X 

sin 6 0 


— sin 8 cos 9 


. y . 


cos 2 9 cos 9 sin 9 


X 

sin 9 cos 9 sin 2 9 


. y . 


2.3 Recurrence relations 


DEFINITION 2.3.1 (The identity matrix) The n x n matrix I n = 
[<5jj], defined by 8^ = 1 if i = j, 8^ = 0 if i / j, is called the n x n identity 
matrix of order n. In other words, the columns of the identity matrix of 
order n are the unit vectors Ei, ■ ■ ■ , E n , respectively. 

For example, Ii= q ^ • 

THEOREM 2.3.1 If A is m x n, then I m A = A = AI n . 


DEFINITION 2.3.2 (k— th power of a matrix) If A is an nxn matrix, 
we define A k recursively as follows: A 0 = I n and A k+1 = A k A for k > 0. 

For example A 1 = A 0 A = I n A = A and hence A 2 = A 1 A = A A. 

The usual index laws hold provided AB = BA: 


1. A m A n = A m+n , ( A m ) n = A 

2. {AB) n = A n B n ; 

3. A m B n = B n A m ] 

4. (A + B) 2 = A 2 + 2 AB + B 2 ; 


mn. 


5. {A + B) n = ^2( n .)A i B 

i = 0 


6. (A + B)(A-B) = A 2 -B 2 . 


We now state a basic property of the natural numbers. 

AXIOM 2.3.1 (PRINCIPLE OF MATHEMATICAL INDUCTION) 

If for each n > 1, V n denotes a mathematical statement and 


(i) V\ is true, 



32 


CHAPTER 2. MATRICES 


(ii) the truth ofV n implies that ofP n +\ for each n > 1, 
then V n is true for all n > 1. 


EXAMPLE 2.3.1 Let A = 


A n 


1 + 6 n 
—9 n 


4 

-5 


. Prove that 


4 n 

1 — 6n 


if n > 1. 


Solution. We use the principle of mathematical induction. 
Take V n to be the statement 


A n 


1 + 6 n 4 n 
—9 n 1 — 6 n 


Then V± asserts that 


' 1 + 6x1 

4x1 


7 

4 ' 

-9 x 1 

1-6x1 


-9 

-5 


which is true. Now let n > 1 and assume that V n is true. We have to deduce 
that 


A n+i 


' l + 6(n + l) 

4(n + 1) 


7 + 6n 4n + 4 

— 9(n + 1) 

1 - 6(n + 1) 


— 9n — 9 — 5 — 6n 


Now 


A n +i 


A n A 
1 + 6 n 
—9 n 


4 n 

1 — 6 n 


7 4 

-9 -5 


(1 + 6n)7 + (4n)(-9) (1 + 6n)4 + (4n)(-5) 

(— 9n)7 + (1 - 6n)(— 9) (-9n)4 + (1 - 6ra)(-5) 


7 + 6n 4n + 4 
— 9n — 9 — 5 — 6n ’ 


and “the induction goes through”. 

The last example has an application to the solution of a system of re- 
currence relations: 
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EXAMPLE 2.3.2 The following system of recurrence relations holds for 
all n > 0: 


x n+ i = 7x n + 4 y n 

Vn+i = ~9x n - 5 y n . 

Solve the system for x n and y n in terms of xq and yo- 


Solution. Combine the above equations into a single matrix equation 


^n+1 


7 

4 ' 



Un+1 


-9 

-5 


_ Vn _ 


or X n+ i = AX n , where A = 
We see that 


7 4 

-9 -5 


and X n = 


x n 

Vn 


X\ = AXo 

X 2 = AX i = A(AX 0 ) = A 2 X o 


X n = A n X 0 . 


(The truth of the equation X n = A n X o for n > 1, strictly speaking 
follows by mathematical induction; however for simple cases such as the 
above, it is customary to omit the strict proof and supply instead a few 
lines of motivation for the inductive statement.) 

Hence the previous example gives 


Vn 


= x„ 


1 + 6n 
—9 n 


4 n 

1 — 6 n 



Xo 


. y® . 


(1 + 6n)x 0 + (4n)y 0 
(— 9n)x 0 + (1 - 6 n)y 0 \ ’ 


and hence x n = (l + 6n).xo + 4n?/o and y n = (— 9n)xo + (l —6n)yo, for n> 1. 


2.4 PROBLEMS 

1. Let A, B, C, D be matrices defined by 


i 

CO 

1 

o 


1 

5 

to 

1 

-1 

2 

II 

cq 

-1 

1 

0 

1 

1 _ 


1 

1 

1 

1 

CO 
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C = 


-3 -1 
2 1 
4 3 


D = 


4 -1 
2 0 


Which of the following matrices are defined? Compute those matrices 
which are defined. 

A + B, A + C, AB, BA, CD, DC, D 2 . 

[Answers: A + C, BA, CD, D 2 \ 


1 

0 

-1 ' 


1 

3 

1 

kO 

1 




0 12 
-4 2 

-10 5 


-14 3 

10 -2 
22 -4 



[14 -4 1 

1 

CN 

1 

00 


2. Let A = 
then 


-1 0 1 

0 1 1 


Show that if B is a 3 x 2 such that AB = I 2 , 


B = 


a b 

—a — 1 1 — 6 

a + 1 6 


for suitable numbers a and 6. Use the associative law to show that 
(. BA) 2 B = B. 


3. If A = 


a b 
c d 


4. If A = 


4 -3 
1 0 

induction, to prove that 


prove that A 2 — (a + d)A + (ad — 6c) I 2 = 0. 

, use the fact A 2 = 4 A — 3/2 and mathematical 


if n > 1. 


5. A sequence of numbers x \ , X 2 , ■ ■ ■ , x n , . . . satisfies the recurrence rela- 
tion x n+ i = ax n + bx n -i for n> 1. where a and 6 are constants. Prove 
that 

^n+1 


= A 


71 

1 
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where A = 

If a = 4 and 


a b 
1 0 


and hence express 


%n+ 1 
%n. 


in terms of 


x\ 

Xq 


b = — 3, use the previous question to find a formula for 


x n in terms of x\ and xo- 
[Answer: 


3 n - 1 3 - 3 n 

Xn = X 1 H Xq. 


2 a —a 2 
1 0 


6. Let A = 

(a) Prove that 


A n = 


( n + l)a n 

^ n — 1 


na 


—na n+l 
(1 — n)a n 


if n > 1. 


(b) A sequence xq, xi, . . . , x n , . . . satisfies the recurrence relation x n+ \ 
2 ax n — a 2 x n -i for n > 1. Use part (a) and the previous question 
to prove that x n = na n ~ 1 x \ + (1 — n)a n x o for n > 1. 


7. Let A = 


a b 
c d 


and suppose that Ai and A 2 are the roots of the 
quadratic polynomial x 2 — ( a+d)x+ad—bc . (Ai and A 2 may be equal.) 
Let k n be defined by ko = 0, k\ = 1 and for n > 2 


k n = Y J K~%~ 1 - 

i — 1 


Prove that 

kn + 1 — (Ai H - X 2 )k n \\\ 2 k n — 1, 

if n > 1. Also prove that 

h = f (A? - A5)/(A! - A 2 ) ifA^Aa, 
n \ nA" _1 if Ai = A 2 . 

Use mathematical induction to prove that if n > 1, 

A n = k n A — Xi\2k n -il2, 

[Hint: Use the equation A 2 = (a + d)A — (ad — bc)I 2 -] 




10. Let r > 1 be an integer. Let a and b be arbitrary positive integers. 
Sequences x n and y n of positive integers are defined in terms of a and 
b by the recurrence relations 


x n +i = x n + ry n 
Un+ 1 — x n y n , 


for n > 0, where xq = a and yo = b. 
Use Question 6 to prove that 


Xn i— 

► Vr 

Un 



OO. 


2.5 Non— singular matrices 

DEFINITION 2.5.1 (Non-singular matrix) 

A square matrix A € M nxn (F ) is called non-singular or invertible if 
there exists a matrix B £ M nxn (F) such that 

AB = I n = BA. 

Any matrix B with the above property is called an inverse of A. If A does 
not have an inverse, A is called singular. 
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THEOREM 2.5.1 (Inverses are unique) 

If A has inverses B and C, then B = C. 

Proof. Let B and C be inverses of A. Then AB = I n = BA and AC = 
I n = CA. Then B{AC ) = BI n = B and ( BA)C = I n C = C. Hence because 
B(AC) = ( BA)C , we deduce that B = C. 

REMARK 2.5.1 If A has an inverse, it is denoted by A -1 . So 

AA- 1 = I n = A- 1 A. 

Also if A is non-singular, it follows that A ^ 1 is also non-singular and 

(A " 1 )- 1 = A. 

THEOREM 2.5.2 If A and B are non-singular matrices of the same size, 
then so is AB. Moreover 


{AB)- 1 = B~ 1 A~ 1 . 


Proof. 


{AB){B~ 1 A~ 1 ) = A(BB~ 1 )A~ 1 = A/ n A _1 = AA -1 = I n . 


Similarly 


{B~ 1 A~ 1 ){AB) = I n . 


REMARK 2.5.2 The above result generalizes to a product of m non- 
singular matrices: If Ai, . . . ,A m are non-singular n x n matrices, then the 
product Ai . . . A m is also non-singular. Moreover 


(Ai . ..A m ) 1 = A n l . . . A^. 


(Thus the inverse of the product equals the product of the inverses in the 
reverse order.) 


EXAMPLE 2.5.1 If A and B are n x n matrices satisfying A 2 = B 2 = 
{AB) 2 = I n , prove that AB = BA. 


Solution. Assume A 2 = B 2 = {AB) 2 = I n . Then A, B, AB are non- 
singular and A -1 = A, L > -1 = B, {AB)- 1 = AB. 

But {AB)- 1 = B- 1 A- 1 and hence AB = BA. 
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EXAMPLE 2.5.2 A = 


1 2 
4 8 


is singular. For suppose B 


is an inverse of A. Then the equation AB = 1-2 gives 


r 1 2 1 


1 

-0 


'10' 

00 


c d 


0 i 


a b 
c d 


and equating the corresponding elements of column 1 of both sides gives the 
system 


a + 2c = 1 

4a + 8c = 0 


which is clearly inconsistent. 

THEOREM 2.5.3 Let A = 

non-singular. Also 


a b 
c d 


and A = ad — be / 0. Then A is 


A' 1 = A’ 1 


d -b 
— c a 


REMARK 2.5.3 The expression ad — be is called the determinant of A 
and is denoted by the symbols det A or 1 


Proof. Verify that the matrix B = A 1 
AB = I 2 = BA. 

EXAMPLE 2.5.3 Let 


c d 


d -b 
— c a 


satisfies the equation 


A = 


0 1 0 
0 0 1 
5 0 0 


Verify that A 3 = 5/3, deduce that A is non-singular and find A 1 . 
Solution. After verifying that A 3 = 5/3, we notice that 

A (l A2 )- I ^(l Al ) A 


" = ^ 2 - 


Hence A is non-singular and A 
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THEOREM 2.5.4 If the coefficient matrix 4 of a system of n equations 
in n unknowns is non-singular, then the system AX = B has the unique 
solution X = A~ 1 B. 

Proof. Assume that A exists. 

1. (Uniqueness.) Assume that AX = B. Then 

(A~ 1 A)X = A _1 B, 

I n X = A~ l B, 

X = A~ X B. 
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has the unique solution 



1 r de-bf 1 _ 1 r Ai 1 r Ai/A ' 
A -ce + af A A 2 A 2 /A 

Hence x = Ai/A, y = A 2 /A. 

COROLLARY 2.5.1 The homogeneous system 

ax + by = 0 

cx + riy = 0 

has only the trivial solution if A = ° ^ ^ 0. 

EXAMPLE 2.5.4 The system 

7x + 8y = 100 
2x — 9 y = 10 


has the unique solution x = Ai/A, y = A 2 /A, where 

a 7 8 „ A 100 8 . 7 100 

A = 2 -9 10 —9 =- 980 ' A2 = 2 10 


So x = ^ and y = 


THEOREM 2.5.6 Let A be a square matrix. If A is non-singular, the 
homogeneous system AX = 0 has only the trivial solution. Equivalently, 
if the homogenous system AX = 0 has a non-trivial solution, then A is 
singular. 


Proof. If A is non-singular and AX = 0, then X = A 1 0 = 0. 

REMARK 2.5.4 If A*i, . . . , A* n denote the columns of A, then the equa- 
tion 

AX = aqA*i + . . . + x n A m 

holds. Consequently theorem 2.5.6 tells us that if there exist scalars xi, ... ,x n , 
not all zero, such that 


X\A%\ “t - . . . + x^A-^yi — 0 



2.5. NON-SINGULAR MATRICES 


41 


that is, if the columns of A are linearly dependent, then A is singular. An 
equivalent way of saying that the columns of A are linearly dependent is that 
one of the columns of A is expressible as a sum of certain scalar multiples 
of the remaining columns of A; that is one column is a linear combination 
of the remaining columns. 

EXAMPLE 2.5.5 


A 


1 2 3 
1 0 1 
3 4 7 


is singular. For it can be verified that A has reduced row-echelon form 

'10 1 ' 

0 1 1 
_ 0 0 0 _ 

and consequently AX = 0 has a non-trivial solution x = —1, y = —1, z = 1. 


REMARK 2.5.5 More generally, if A is row-equivalent to a matrix con- 
taining a zero row, then A is singular. For then the homogeneous system 
AX = 0 has a non-trivial solution. 


An important class of non-singular matrices is that of the elementary 
row matrices. 

DEFINITION 2.5.2 (Elementary row matrices) There are three types, 
Eij, Ei(t ), E t j(t), corresponding to the three kinds of elementary row oper- 
ation: 

1. Eij, (i ^ j ) is obtained from the identity matrix I n by interchanging 
rows i and j. 

2. Ei(t), (t / 0) is obtained by multiplying the z-th row of I n by t. 

3. Eij{t), (i / j) is obtained from I n by adding t times the j-th row of 
I n to the i-th row. 


EXAMPLE 2.5.6 (n = 3.) 



'10 0' 


' 1 0 0' 


'10 o' 

E 23 = 

0 0 1 

0 1 0 

, e 2 (- 1) = 

0-10 

0 0 1 

, E 23 ( 1 ) = 

0 1 -1 

0 0 1 
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The elementary row matrices have the following distinguishing property: 

THEOREM 2.5.7 If a matrix A is pre-nrultiplied by an elementary row- 
matrix, the resulting matrix is the one obtained by performing the corre- 
sponding elementary row-operation on A. 

EXAMPLE 2.5.7 



a b 


' 1 

0 

0 ' 


a b 


a b 

E 23 

c d 

= 

0 

0 

1 


c d 

= 

e / 


_ e f _ 


0 

1 

0 


_ e / _ 


c d 


COROLLARY 2.5.2 The three types of elementary row-matrices are non- 
singular. Indeed 

1 - E^ = Eij ; 

2. Er\t) = 

3. ( Eij(t )) 1 = Eij(-t). 


Proof. Taking A = I n in the above theorem, we deduce the following 
equations: 

Ejj Eij — I n 

EiWEiit- 1 ) = In = Eiit-^Eiit) iff ^0 
Eij (t) Eij (-t ) = I n = Eij (-t) Eij (f). 

EXAMPLE 2.5.8 Find the 3x3 matrix A = E 3 (T)E2 3 {2)E\2 explicitly. 
Also find A -1 . 

Solution. 


A — E 3 (5)E 23 (2) 

1 

O 

O 

O O 

1 

= £ 3 ( 5 ) 

1 

0 

0 

to 0 

1 

— 

1 

0 

O 

to 0 

1 


0 0 1 


0 0 1 


0 0 5 I 


To find A 1 , we have 

A- 1 = (E 3 (5)E 23 (2)E 12 r 1 

= E^(E 2 3 (2))- 1 (E 3 (5))- 1 
= Ei2E2 3 (—2)E 3 (5 x ) 
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= Ei2E-23( — 2 ) 


1 0 
0 1 


0 0 ± 



' 1 

0 

0 ' 


' 0 

1 

2 - 

5 

E\2 

0 

1 

2 

5 

= 

1 

0 

0 


0 

0 

1 

5 - 


0 

0 

1 

5 - 


REMARK 2.5.6 Recall that A and B are row-equivalent if B is obtained 
from A by a sequence of elementary row operations. If E ±, . . . , E r are the 
respective corresponding elementary row matrices, then 


B = E r (... (E 2 (E 1 A)) ...) = {E r ... E\) A = PA, 

where P = E r . . . E\ is non-singular. Conversely if B = PA, where P is 
non-singular, then A is row-equivalent to B. For as we shall now see, P is 
in fact a product of elementary row matrices. 


THEOREM 2.5.8 Let A be non-singular n x n matrix. Then 

(i) A is row-equivalent to I n , 

(ii) A is a product of elementary row matrices. 

Proof. Assume that A is non-singular and let B be the reduced row-echelon 
form of A. Then B has no zero rows, for otherwise the equation AX = 0 
would have a non-trivial solution. Consequently B = I n . 

It follows that there exist elementary row matrices E \ , . . . , E r such that 
E r (. . . {E\A ) . . .) = B = I n and hence A = E^[ 1 ...E~ 1 , a product of 
elementary row matrices. 

THEOREM 2.5.9 Let Abe n x n and suppose that A is row-equivalent 
to I n . Then A is non-singular and A -1 can be found by performing the 
same sequence of elementary row operations on I n as were used to convert 
A to I n . 

Proof. Suppose that E r . . . E\A = I n . In other words BA = I n , where 
B = E r ... E\ is non-singular. Then B~ L (BA) = and so A = L> -1 , 

which is non-singular. 

Also A^ 1 = 1 = B = E r ((. . . ( E\I n ) . . .), which shows that A -1 

is obtained from I n by performing the same sequence of elementary row 
operations as were used to convert A to I n . 
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REMARK 2.5.7 It follows from theorem 2.5.9 that if A is singular, then 
A is row-equivalent to a matrix whose last row is zero. 


EXAMPLE 2.5.9 Show that A 


1 2 
1 1 


is non-singular, find A 1 and 


express A as a product of elementary row matrices. 


Solution. We form the partitioned matrix [A|/ 2 ] which consists of A followed 
by I 2 . Then any sequence of elementary row operations which reduces A to 
I 2 will reduce I 2 to A -1 . Here 


[A\h] = 

R-2 — R\ 

(-1)^2 


1 2 

1 1 

1 2 

0 -1 


1 0 

0 1 

1 0 

-1 1 


R 2 
R 2 

R\ — > R\ — 2 R 2 
Hence A is row-equivalent to I 2 and A is non-singular. Also 


' 1 

2 

1 

1 

O 

0 

1 

1 

-lj 


' 1 

0 

-1 

to 

1 

0 

1 

1 

-ij 


A’ 1 


-1 2 
1 -1 


We also observe that 


1 2 ( 2 ) E2 ( 1 ) E2 1 ( 1 ) A — I2. 


Hence 


A’ 1 = £ 12 (— 2)£ 2 (— 1)£ 21 (— 1) 

A = E 2 i(1)E 2 (— 1)£i 2 (2). 

The next result is the converse of Theorem 2.5.6 and is useful for proving 
the non-singularity of certain types of matrices. 

THEOREM 2.5.10 Let A be an n x n matrix with the property that 
the homogeneous system AX = 0 has only the trivial solution. Then A is 
non-singular. Equivalently, if A is singular, then the homogeneous system 
AX = 0 has a non-trivial solution. 
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Proof. If A is n x n and the homogeneous system AX = 0 has only the 
trivial solution, then it follows that the reduced row-echelon form B of A 
cannot have zero rows and must therefore be I n . Hence A is non-singular. 

COROLLARY 2.5.3 Suppose that A and B are n x n and AB = I n . 
Then BA = I n . 

Proof. Let AB = I n , where A and B are n x n. We first show that B 
is non-singular. Assume BX = 0. Then A(BX) = AO = 0, so ( AB)X = 
0, I n X = 0 and hence X = 0. 

Then from AB = I n we deduce ( AB)B ~ 1 = I n B~ l and hence A = B~ l . 
The equation BB~ 1 = I n then gives BA = I n . 

Before we give the next example of the above criterion for non-singularity, 
we introduce an important matrix operation. 

DEFINITION 2.5.3 (The transpose of a matrix) Let A be an m x n 

matrix. Then A*, the transpose of A, is the matrix obtained by interchanging 
the rows and columns of A. In other words if A = [ciy], then (A*) .. = a. t j. 
Consequently A f is n x m. 

The transpose operation has the following properties: 

1. {A t ) t = A- 

2. (A ± BY = A 1 ± B t if A and B are m x n; 

3. (. sAY = sA t if s is a scalar; 

4. ( AB y = B t A t if A is m x n and B is n x p; 

5. If A is non-singular, then A* is also non-singular and 

(aTMa- 1 )*; 

6. X f X = x\ + . . . + if X = [aq, . . . , a: n ] t is a column vector. 

We prove only the fourth property. First check that both ( AB )* and B t A t 
have the same size (p x m). Moreover, corresponding elements of both 
matrices are equal. For if A = [a^] and B = [ bjk ], we have 


((AB)‘) tl = (AB) ik 
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= 

There are two important classes of matrices that can be defined concisely 
in terms of the transpose operation. 

DEFINITION 2.5.4 (Symmetric matrix) A real matrix A is called sym- 
metric if A* = A. In other words A is square (n x n say) and aji = a^ for 
all 1 < i < n, 1 < j < n. Hence 


is a general 2x2 symmetric matrix. 

DEFINITION 2.5.5 (Skew— symmetric matrix) A real matrix A is called 
skew-symmetric if A* = —A. In other words A is square (n x n say) and 
aji = —aij for all 1 < i < n, 1 < j < n. 

REMARK 2.5.8 Taking i = j in the definition of skew-symmetric matrix 
gives an = —an and so an = 0. Hence 

. I" 0 b 
A = -b 0 


is a general 2x2 skew-symmetric matrix. 


We can now state a second application of the above criterion for non- 
singularity. 

COROLLARY 2.5.4 Let B be an n x n skew-symmetric matrix. Then 
A = I n — B is non-singular. 

Proof. Let A = I n — B, where B l = —B. By Theorem 2.5.10 it suffices to 
show that AX = 0 implies X = 0. 

We have (I n - B)X = 0, so X = BX. Hence X f X = X t BX. 

Taking transposes of both sides gives 

(X^BXY = (X t X) t 
X t B t (X t ) t = X t (X t ) t 
X\-B)X = X l X 
-X t BX = X l X = X t BX. 


Hence X*X = —X t X and X t X = 0. But if A = [aq, . . . , x n ]\ then X l X = 
x\ + . . . + X’); = 0 and hence x\ = 0, . . . , x n = 0. 
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2.6 Least squares solution of equations 

Suppose AX = B represents a system of linear equations with real coeffi- 
cients which may be inconsistent, because of the possibility of experimental 
errors in determining A or B. For example, the system 

x = 1 

y = 2 

x + y = 3.001 


is inconsistent. 

It can be proved that the associated system A t AX = A f B is always 
consistent and that any solution of this system minimizes the sum r\ + . . . + 
r, 2 n , where n, . . . , r m (the residuals) are defined by 


Ti — CLi “b . . . T ai n X n bi , 


for i = 1 , ... ,m. The equations represented by A 1 AX = A f B are called the 
normal equations corresponding to the system AX = B and any solution 
of the system of normal equations is called a least squares solution of the 
original system. 


EXAMPLE 2.6.1 Find a least squares solution of the above inconsistent 
system. 



'10' 


r i 


1 

Solution. Here A = 

0 1 

1 1 

,X = 

X 

. y _ 

,B = 

2 

3.001 


Then A 1 A = 

Also A t B = 


1 0 1 
0 1 1 

1 0 1 
0 1 1 


1 0 
0 1 
1 1 
1 
2 

3.001 


So the normal equations are 


2 1 
1 2 

4.001 

5.001 


2 x + y = 4.001 
x + 2 y = 5.001 


which have the unique solution 

3.001 6.001 

, y = 


X = 


3 


3 
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EXAMPLE 2.6.2 Points (aq, yi),...,(x n , y n ) are experimentally deter- 
mined and should lie on a line y = mx + c. Find a least squares solution to 
the problem. 

Solution. The points have to satisfy 

mx i + c = y\ 


mx n + c = y n , 


or Ax = B, where 


A = 

X\ 1 

, x = 

m 

,B = 

yi 


x n 1 


c 


_ Vn _ 


The normal equations are given by (A t A)X = A t B. Here 




X\ 1 




X\ . 

. . x n 




x\ + .. 

■■ + x„ Xl + . . . + x n 

1 . 

.. 1 


_X n 1 _ 


Xl + .. 

■ • + X n n 







Also 





yi 




Xl . 

. . x n 




xi yi + ■ ■ 

. + x n y n 

1 . 

.. 1 


_ yn _ 


yi + • 

• • + Vn 


It is not difficult to prove that 

A = det (A t A) = ^ ( Xi — Xj ) 2 , 

l<i<j<n 

which is positive unless x\ = ... = x n . Hence if not all of x±, . . . , x n are 
equal, A t A is non-singular and the normal equations have a unique solution. 
This can be shown to be 

m = ^ ^2 (Xi- Xj)(yi - yj ),c= ^ ^ (■ x^j - xjy^Xi - Xj). 


REMARK 2.6.1 The matrix A f A is symmetric. 
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2.7 PROBLEMS 

r i 4 i 

1. Let A = . Prove that A is non-singular, find A 1 and 

o JL 

express A as a product of elementary row matrices. 

r — — — ' 

[Answer: A = 3 s \ 3 , 

L 13 13 - 

A = E- 2 i(— 3)£ , 2(13)£'i2(4) is one such decomposition.] 

2. A square matrix D = [ dij ] is called diagonal if dij = 0 for i / j. (That 
is the off-diagonal elements are zero.) Prove that pre-multiplication 
of a matrix A by a diagonal matrix D results in matrix DA whose 
rows are the rows of A multiplied by the respective diagonal elements 
of D. State and prove a similar result for post-multiplication by a 
diagonal matrix. 

Let diag(ai, . . . ,a n ) denote the diagonal matrix whose diagonal ele- 
ments da are ai, ... ,a n , respectively. Show that 

diag(ai, . . . , a n )diag (&i, ...,b n ) = diag(ai&i, . . . ,a n b n ) 

and deduce that if a\ . . . a n 7^ 0, then diag (ai, . . . , a n ) is non-singular 
and 

(diag (01 , . . . ,a n )) _1 = diag (a] -1 , . . . jCi" 1 ). 

Also prove that diag (ai, . . . , a n ) is singular if ai = 0 for some i. 

' 0 0 2 ' 

3. Let A = 12 6. Prove that A is non-singular, find A -1 and 

_ 3 7 9 _ 

express A as a product of elementary row matrices. 

' -12 7 -2 ' 

[Answers: A -1 = § —3 1 , 

. 5 0 0 _ 

A = Ei2Ll3i(3) J S23^3(2) J Si2(2)^i3(24) J L23(-9) is one such decompo- 
sition.] 
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4. Find the rational number k for which the matrix A = 
is singular. [Answer: k = —3.] 


1 2 k 
3 -1 1 
5 3-5 


5. Prove that A = 


1 2 
-2 -4 

P such that PA has last row zero. 


is singular and find a non-singular matrix 


6. If A = 


1 4 
-3 1 
A ~ 1 = -±(A-2I 2 ). 


, verify that A 2 — 2 A + 13I 2 = 0 and deduce that 


7. Let A 


1 1 -1 

0 0 1 

2 1 2 


(i) Verify that A 3 = 3 A 2 — 3 A + I 3 . 

(ii) Express A 4 in terms of A 2 , A and I 3 and hence calculate A 4 
explicitly. 

(iii) Use (i) to prove that A is non-singular and find A -1 explicitly. 


[Answers: (ii) A 4 = 6A 2 — 8A + 3/3 


-11 -8 -4 
12 9 4 

20 16 5 


(iii) A 1 = A 2 — 3A + 3/3 


-1 -3 1 

2 4 -1 .] 

0 1 0 


8. (i) Let B be an n x n matrix such that B 3 = 0. If A = I n — B, prove 

that A is non-singular and A -1 = I n + B + B 2 . 

Show that the system of linear equations AX = b has the solution 


V = b + Bb + B 2 b. 


(ii) If B = 


0 r s 

0 0 t 
0 0 0 _ 
(I 3 — B )- 1 explicit 


verify that B 3 


0 and use (i) to determine 
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[Answer: 


1 r s + rt 

0 1 t .} 

0 0 1 


9. Let A be n x n. 

(i) If A 2 = 0, prove that A is singular. 

(ii) If A 2 = A and A ^ I n , prove that A is singular. 

10. Use Question 7 to solve the system of equations 

x + y — z = a 
z = b 
2x + y + 2z = c 

where a, b, c are given rationals. Check your answer using the Gauss- 
Jordan algorithm. 

[Answer: x = —a — 36 + c, y = 2a + 46 — c, z = b .] 

11. Determine explicitly the following products of 3 x 3 elementary row 
matrices. 


(i) E12E23 (ii) Ei( 5)E\2 (iii) £'i2(3)£ , 2i(— 3) (iv) (£i(100)) 1 
(v) (vi) {E l2 {7))- 1 (vii) (E 12 (7)E 31 (1))- 1 . 



' 0 

0 

1 ' 


' 0 

5 

0 ' 


' -8 

3 

0 ' 

[Answers: (i) 

1 

0 

0 

(ii) 

1 

0 

0 

(iii) 

-3 

1 

0 


0 

1 

0 


0 

0 

1 


0 

0 

1 


- 1 

100 

0 

0 ' 


' 0 

1 

0 ' 


' 1 

-7 

0 ' 


0 

1 

0 

(v) 

1 

0 

0 

(vi) 

0 

1 

0 

(vii) 

0 

0 

1 


0 

0 

1 


0 

0 

1 



12. Let A be the following product of 4 x 4 elementary row matrices: 

A = E 3 (2)E u E 42 (3). 

Find A and A~ 1 explicitly. 



' 0 

3 

0 

1 ' 


' 0 

0 

0 

1 ' 

[Answers: A = 

0 

1 

0 

0 

, A~* = 

0 

1 

0 

1 

2 

0 

0 

0 

2 

0 

0 

0 

0 


1 

0 

0 

0 


1 

-3 

0 

0 



52 


CHAPTER 2. MATRICES 


13. Determine which of the following matrices over Z 2 are non-singular 
and find the inverse, where possible. 


' 1 

1 

0 1 ' 


' 1 

1 

0 

1 ' 

0 

0 

1 1 

(b) 

0 

1 

1 

1 

1 

1 

1 1 

1 

0 

1 

0 

_ 1 

0 

0 1 _ 


_ 1 

1 

0 

1 _ 


1 1 1 
0 0 1 
0 1 0 
1 1 0 


[Answer: (a) 


14. Determine which of the following matrices are non-singular and find 
the inverse, where possible. 


(a) 

'111' 
-1 1 0 

(b) 

' 2 2 4 ' 
1 0 1 

(c) 

'4 6 -3 ' 
0 0 7 


to 

O 

O 


0 1 0 


0 0 5 





' 1 2 4 6 ' 



(d) 

' 2 0 0 ' 

(e) 

0 12 0 

(f) 

'12 3' 

0-5 0 

0 0 7 

0 0 12 

4 5 6 

5 7 9 




_ 0 0 0 2 _ 





' 0 

0 

1 - 

2 


1 

2 

2 

1 ' 


- 1 

2 

0 

0 ' 

[Answers: (a) 

0 

1 

1 

2 

(b) 

0 

0 

1 

(d) 

0 

1 

5 

0 


1 

-1 

-1 _ 


1 

2 

-1 

-1 


_ 0 

0 

1 

7 - 


1-2 0-3 


L 0 0 0 l J 

15. Let A be a non-singular n x n matrix. Prove that A * is non-singular 
and that (A 4 ) -1 = (A _1 )L 


16. Prove that A 


a b 
c d 


has no inverse if ad — be = 0. 


[Hint: Use the equation A 2 — (a + d)A + (ad — bc)I -2 = 0.] 
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17. Prove that the real matrix A = 


1 a b 
—a 1 c 
—b —c 1 
proving that A is row-equivalent to I 3 . 


is non-singular by 


18. If P l AP = B, prove that P L A n P = B n for n > 1 


. Verify that P l AP = 



- 2 

1 - 


1 3 ’ 

19. Let A = 

! 

. 3 

4 

3 

4 _ 

,p = 

-1 4 


and deduce that 


A 0 
12 u 

0 1 


1 

' 3 

3 ' 

1 , 

4 5 y 

4 

-3 ' 

7 

4 

4 

+ 7^ 

K n) 

-4 

3 


20. Let A = 


a b 
c d 


be a Markov matrix; that is a matrix whose elements 


are non-negative and satisfy a+c = 1 = b+d. Also let P = 
Prove that if A 7 ^ I 2 then 


b 1 
c —1 


(i) P is non-singular and P 1 AP = 
1 


(ii) A n 


b + c 


b b 

c c 


as n — ► 00 , if 4/ 


1 0 
0 a + d — 1 

0 1 
1 0 


21. If A = 

1 

CO I — 1 

^ to 

1 

and Y = 

' -1 ' 
3 


_ 5 6 _ 


4 


find XX\ X f X, YY\ Y l Y . 


[Answers: 


5 11 17 
11 25 39 
17 39 61 


35 44 
44 56 ’ 


1 

-3 

-4 

3 

9 

12 

4 

12 

16 


26.] 


22. Prove that the system of linear equations 

x + 2 y = 4 

x + y = 5 

3x + 5 y = 12 

is inconsistent and find a least squares solution of the system. 
[Answer: x = 6, y = —7/6.] 
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23. The points (0, 0), (1, 0), (2, —1), (3, 4), (4, 8) are required to lie on a 
parabola y = a + bx + cx 2 . Find a least squares solution for a, b, c. 
Also prove that no parabola passes through these points. 

[Answer: a = g, b = —2, c = 1.] 

24. If A is a symmetric nxn real matrix and B is n x m, prove that B f AB 
is a symmetric m x m matrix. 

25. If A is m x n and B is n x rn. prove that AB is singular if m > n. 

26. Let A and B be n x n. If A or B is singular, prove that AB is also 
singular. 



Chapter 3 


SUBSPACES 


3.1 Introduction 

Throughout this chapter, we will be studying F n , the set of all n-dimensional 
column vectors with components from a field F. We continue our study of 
matrices by considering an important class of subsets of F n called subspaces. 
These arise naturally for example, when we solve a system of m linear ho- 
mogeneous equations in n unknowns. 

We also study the concept of linear dependence of a family of vectors. 
This was introduced briefly in Chapter 2, Remark 2.5.4. Other topics dis- 
cussed are the row space, column space and null space of a matrix over F. 
the dimension of a subspace, particular examples of the latter being the rank 
and nullity of a matrix. 

3.2 Subspaces of F n 

DEFINITION 3.2.1 A subset S of F n is called a subspace of F n if 

1. The zero vector belongs to 5; (that is, 0 £ S); 

2. If u E S and v £ S, then u + v £ S; (S is said to be closed under 
vector addition); 

3. If u £ S and t £ F, then f it £ 5; ( S is said to be closed under scalar 
multiplication) . 

EXAMPLE 3.2.1 Let A £ M mxn (F). Then the set of vectors X £ F n 
satisfying AX = 0 is a subspace of F n called the null space of A and is 
denoted here by N(A). (It is sometimes called the solution space of A.) 
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Proof. (1) AO = 0, so 0 G N{A); (2) If X. Y £ N(A ), then AX = 0 and 
AY = 0, so A(X + y) = AX + AY = 0 + 0 = 0 and so I + 7 G X(A); (3) 
If X G N(A) and t G F, then A(tX) = t(AX) = tO = 0, so tX G N(A). 

For example, if A = J 

just the zero vector. If A = 
multiples of [—2, 1]*. 

EXAMPLE 3.2.2 Let X\, . . . , X m G F n . Then the set consisting of all 
linear combinations x\X\ + • • • + x rn X rn . where xi, . . . , x m € F, is a sub- 
space of F n . This subspace is called the subspace spanned or generated by 
X\, , X m and is denoted here by (Xi, . . . , X m ). We also call X\, . . . , X m 
a spanning family for S = (Xi, . . . , X m ). 

Proof. (1) 0 = OXi + • • • + 0X m , so 0 G (X h . . .,X m ); (2) If X, Y G 
(X i, . . . , X m ), then X — x\Xi T ■ ■ ■ T x m X m and 1 — y\X\ T • • • T y m X m , 
so 


then N(A) = {0}, the set consisting of 
2 1 

, then N{A) is the set of all scalar 


X + Y — (xiXi + • • • + x m X m ) + (yiXi + • • • + ymX m ) 

= (%i + y\)X\ h h( %m T ym)X m G (Xi, . . . , X m ) . 

(3) If X G (Xi, . . . , X m ) and t G F, then 

X = xiXi T • • • T x m X m 
tX — t(x\X\ T • • • T x m X m ) 

= (tx i)Xi + • • • + (tx m )X m G (Xi, . . . , X m ). 


For example, if A G M mxn (F), the subspace generated by the columns of A 
is an important subspace of F m and is called the column space of A. The 
column space of A is denoted here by C(A). Also the subspace generated 
by the rows of A is a subspace of F n and is called the row space of A and is 
denoted by R(A). 

EXAMPLE 3.2.3 For example F n = (E\, . . . , E n ), where . . . , E n are 
the n-dinrensional unit vectors. For if X = [xi, . . . , x n ]* G F n , then X = 
x\ E\ + • • • + x n E n . 


EXAMPLE 3.2.4 Find a spanning family for the subspace S of M 3 defined 
by the equation 2x — 3y + 5z = 0. 
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Solution. ( S is in fact the null space of [2, —3, 5], so S is indeed a subspace 
of M 3 .) 

If [x, y, z]* G S, then x = \y — | z. Then 


X 


■§v-§*' 


r 3 -| 
2 


r 5 i 
2 

y 

z 


y 

z 

= y 

l 

_ o _ 

+ z 

0 

l 


and conversely. Hence [|, 1, 0]* and [— |, 0, 1]* form a spanning family for 
S. 

The following result is easy to prove: 

LEMMA 3.2.1 Suppose each of X\, . . . , X r is a linear combination of 
Yi, . . . , Y s . Then any linear combination of X\ , . . . , X r is a linear combi- 
nation Of Yl ..... Yg. 

As a corollary we have 

THEOREM 3.2.1 Subspaces (X\, . . . ,X r ) and (Yi,...,Y s ) are equal if 
each of X \ , . . . , X r is a linear combination of Y\ , . . . , Y s and each of Y\ , . . . , Y s 
is a linear combination of X \ , . . . , X r . 

COROLLARY 3.2.1 Subspaces (A'i, . . . , X r , Z\, . . . , Z t ) and (X\, . . . , X r ) 
are equal if each of Z\, ... ,Zt is a linear combination of X±, . . . , X r . 

EXAMPLE 3.2.5 If X and Y are vectors in M n , then 

(X, Y) = {X + Y, X — Y). 

Solution. Each of X + Y and X — Y is a linear combination of X and Y. 
Also 

X= 1 -(X + Y) + ^(X-Y) and Y = ±(X + Y) - ±(X - Y), 

so each of X and Y is a linear combination of X + Y and X — Y. 

There is an important application of Theorem 3.2.1 to row equivalent 
matrices (see Definition 1.2.4): 

THEOREM 3.2.2 If A is row equivalent to B, then R(A) = R(B). 

Proof. Suppose that B is obtained from A by a sequence of elementary row 
operations. Then it is easy to see that each row of B is a linear combination 
of the rows of A. But A can be obtained from B by a sequence of elementary 
operations, so each row of A is a linear combination of the rows of B. Hence 
by Theorem 3.2.1, R(A) = R(B). 
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REMARK 3.2.1 If A is row equivalent to B, it is not always true that 
C(A) = C(B). 


1 1 
1 1 

reduced row-echelon form of 


For example, if A = 


A 


and B = 


1 1 
0 0 

However we see that 


then B is in fact the 


C{A) = 


and similarly C(B) = / J 

Consequently C(A) / C(B ), as 


€ C(A) but 


?C(B). 


3.3 Linear dependence 

We now recall the definition of linear dependence and independence of a 
family of vectors in F n given in Chapter 2. 

DEFINITION 3.3.1 Vectors X\, ... , X m in F n are said to be linearly 
dependent if there exist scalars xi, . , , ,x m , not all zero, such that 

x\X\ + • • • + x m X m = 0. 

In other words, X\ , . . . , X m are linearly dependent if some X * is expressible 
as a linear combination of the remaining vectors. 

Xi, , X rn are called linearly independent if they are not linearly depen- 
dent. Hence X \, . . . , X m are linearly independent if and only if the equation 

.r i A | T • • • T x m X m — 0 

has only the trivial solution aq = 0 , . . . , x m = 0. 

EXAMPLE 3.3.1 The following three vectors in M 3 


Xi = 

' 1 ' 
2 

, *2 = 

' -1 ' 
1 

, ^3 = 

' -1 ' 
7 


3 


2 


12 


are linearly dependent, as 2Xi + 3 X 2 + (— 1)X"3 = 0. 
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REMARK 3.3.1 If X \ . . . . , X m are linearly independent and 


x\X\ + • • • + x m X m — y\X\ + • • • + y m X m , 
then x\ = yi, . ... , x m = y m . For the equation can be rewritten as 
(x i y\)X\ + • • • + ( x m ym)X m — 0 

and so x± - yi = 0, . . . , x m - y m = 0. 

THEOREM 3.3.1 A family of m vectors in F n will be linearly dependent 
if m > n. Equivalently, any linearly independent family of m vectors in F n 
must satisfy m < n. 

Proof. The equation 

X\Xi T * * * T XrnXrn — 0 

is equivalent to n homogeneous equations in m unknowns. By Theorem 1.5.1, 
such a system has a non-trivial solution if m > n. 

The following theorem is an important generalization of the last result 
and is left as an exercise for the interested student: 

THEOREM 3.3.2 A family of s vectors in {X\, . . . , X r ) will be linearly 
dependent if s > r. Equivalently, a linearly independent family of s vectors 
in (Xi, . . . , X r ) must have s < r. 


Here is a useful criterion for linear independence which is sometimes 
called the left-to-right test : 

THEOREM 3.3.3 Vectors X \, . . . , X m in F n are linearly independent if 

(a) X x + 0; 

(b) For each k with 1 < k < m, X*, is not a linear combination of 
Xi , . . . , X fc _i. 

One application of this criterion is the following result: 

THEOREM 3.3.4 Every subspace S of F n can be represented in the form 
S = (Xi, . . . , X m ), where m < n. 
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Proof. If S = {0}, there is nothing to prove - we take X\ = 0 and m = 1. 

So we assume S contains a non-zero vector Xi; then (Xi } C S as S is a 
subspace. If S' = (Xi), we are finished. If not, S will contain a vector X2, 
not a linear combination of Xi; then (Xi, X2) C S as S is a subspace. If 
S = (Xi, X2), we are finished. If not, S will contain a vector X3 which is 
not a linear combination of X\ and X2. This process must eventually stop, 
for at stage k we have constructed a family of k linearly independent vectors 
Xi, . . . , Xfc, all lying in F n and hence k < n. 

There is an important relationship between the columns of A and B, if 
A is row-equivalent to B. 

THEOREM 3.3.5 Suppose that A is row equivalent to B and let ci, . . . , c r 
be distinct integers satisfying 1 < Cj < n. Then 

(a) Columns A * Cl , . . . , A* Cr of A are linearly dependent if and only if the 
corresponding columns of B are linearly dependent; indeed more is 
true: 

*£-1^4*01 ’ T A-%c r — 0 < t = /‘ X[B*c 1 T * * * T %rB* Cr — 0. 

(b) Columns A * C1 , . . . , A* Cr of A are linearly independent if and only if the 
corresponding columns of B are linearly independent. 

(c) If 1 < c r+ 1 < n and c r+ \ is distinct from c\ , . . . ,c r , then 

X* Cr _j_i — ziA% Cl T * * • T z r A% Cr ■£=/■ B* Cr+1 — z\B^ Cl T ■ ■ ■ T z r B* Cr . 

Proof. First observe that if Y = [y 1 , . . . , y n ] 1 is an n-dinrensional column 
vector and A is rn x n, then 


— y i.A*i T • • • T ynA*n 


Also AY = 0 BY = 0, if B is row equivalent to A. Then (a) follows by 
taking y* = x Cj if i = cj and y* = 0 otherwise. 

(b) is logically equivalent to (a), while (c) follows from (a) as 


A 


*C r + 1 


z\A* Cl + • • • + z r A * 


z\A* Cl T * * * T z r A* Cr T ( l)A* Cr+1 — 0 
< t = 7 > z\B* Cl + • • • + z r B* Cr T ( 1 ) B* c T — 1 — 0 
‘X?’ B* Cr + 1 — ziR* ci T * * * T ZrB*c r ‘ 
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EXAMPLE 3.3.2 The matrix 

'1 15 1 4 

A = 2-112 2 

3 0 6 0 -3 _ 

has reduced row-echelon form equal to 

" 1 0 2 0 -1 ' 

B= 0130 2 . 

0 0 0 1 3 

We notice that 5*i, 5 * 2 and 5 * 4 are linearly independent and hence so are 
A*i, A * 2 and A* 4 . Also 

5*3 = 25*i + 35*2 

5*5 = (—l)5*i + 25*2 T 35*4, 

so consequently 

A *3 = 2 A*i + 3A*2 

^4*5 = (— l)A*i + 2A*2 + 3A* 4 . 

3.4 Basis of a subspace 

We now come to the important concept of basis of a vector subspace. 

DEFINITION 3.4.1 Vectors X±, . . . , X m belonging to a subspace S are 
said to form a basis of S if 

(a) Every vector in S is a linear combination of X\, . . . , X m ; 

(b) Xi, ... , X m are linearly independent. 

Note that (a) is equivalent to the statement that S = (X 4 , . . . ,X m ) as we 
automatically have {X\. . . . , X m ) C S. Also, in view of Remark 3.3.1 above, 
(a) and (b) are equivalent to the statement that every vector in S is uniquely 
expressible as a linear combination of X \, . . . , X m . 

EXAMPLE 3.4.1 The unit vectors E\. . . . , E n form a basis for F n . 
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REMARK 3.4.1 The subspace {0}, consisting of the zero vector alone, 
does not have a basis. For every vector in a linearly independent family 
must necessarily be non-zero. (For example, if X\ = 0, then we have the 
non-trivial linear relation 

1X\ + OA 2 + • • • + 0X m = 0 

and Xi , . . . , X m would be linearly dependent.) 

However if we exclude this case, every other subspace of F n has a basis: 

THEOREM 3.4.1 A subspace of the form (Xi,...,X m ), where at least 
one of Xi , . . . , X m is non-zero, has a basis X C1 , . . . , X Cr , where 1 < c\ < 
• • • < c r < m. 

Proof. (The left-to-right algorithm ). Let c\ be the least index k for which 
Afc is non-zero. If c\ = rn or if all the vectors X & with k > c\ are linear 
combinations of X C1 , terminate the algorithm and let r = 1. Otherwise let 
C 2 be the least integer k > c\ such that X^ is not a linear combination of 

X C1 . 

If C 2 = m or if all the vectors X & with k > c 2 are linear combinations 
of X C] and X C2 , terminate the algorithm and let r = 2. Eventually the 
algorithm will terminate at the r-th stage, either because c r = m, or because 
all vectors X^ with k > c r are linear combinations of X Cl , , X Cr . 

Then it is clear by the construction of X C1 , . . . , X Cr , using Corollary 3.2.1 
that 

(a) (X cl ,...,X Cr ) = (X 1 ,...,X m )- 

(b) the vectors X C1 , . . . , X Cr are linearly independent by the left-to-right 
test. 

Consequently X C1 , . . . , X Cr form a basis (called the left-to-right basis ) for 
the subspace (Xi, . . . , X m ). 

EXAMPLE 3.4.2 Let X and Y be linearly independent vectors in M n . 
Then the subspace (0, 2X, X, —Y, X + Y) has left-to-right basis consisting 
of 2X, -Y. 

A subspace S will in general have more than one basis. For example, any 
permutation of the vectors in a basis will yield another basis. Given one 
particular basis, one can determine all bases for S using a simple formula. 
This is left as one of the problems at the end of this chapter. 

We settle for the following important fact about bases: 
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THEOREM 3.4.2 Any two bases for a subspace S must contain the same 
number of elements. 

Proof. For if X\, ... ,X r and Y±, ... ,Y S are bases for S, then Y\, . . . , Y s 
form a linearly independent family in S = (A] , . . . , X r ) and hence s < r by 
Theorem 3.3.2. Similarly r < s and hence r = s. 

DEFINITION 3.4.2 This number is called the dimension of S and is 
written dim S'. Naturally we define dim{0} = 0. 

It follows from Theorem 3.3.1 that for any subspace S of F n , we must have 
dim S < n. 

EXAMPLE 3.4.3 If E\, . . . , E n denote the n-dimensional unit vectors in 
F n , then dim (E \, . . . , Ei) = i for 1 < i < n. 


The following result gives a useful way of exhibiting a basis. 

THEOREM 3.4.3 A linearly independent family of rn vectors in a sub- 
space S, with dim S = rn. must be a basis for S. 

Proof. Let Xi, . . . ,X m be a linearly independent family of vectors in a 
subspace S, where dim S = m. We have to show that every vector X G S is 
expressible as a linear combination of A] , ... , A m . We consider the following 
family of vectors in S: Ai, . . . , X m , A. This family contains m + 1 elements 
and is consequently linearly dependent by Theorem 3.3.2. Hence we have 

x\X\ T ■ ■ • T x m X m T x m+ iX — 0, (3-1) 

where not all of x\ , . . . ,x m+ i are zero. Now if x m+ i = 0, we would have 

aqAi A ■ ■ ■ x m Xm, — 0? 

with not all of x±, . . . , x m zero, contradictiong the assumption that X\ ... , X m 
are linearly independent. Hence x m+ \ / 0 and we can use equation 3.1 to 
express A as a linear combination of A] X m : 

A=^A 1 + ... + ^A m . 

^ m +1 % m -\- 1 
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3.5 Rank and nullity of a matrix 

We can now define three important integers associated with a matrix. 
DEFINITION 3.5.1 Let A e M mxn (F). Then 

(a) column rank A = dim C { A ) ; 

(b) row rank A = dim R(A ) ; 

(c) nullity A = dim N(A). 

We will now see that the reduced row-echelon form B of a matrix A allows 
us to exhibit bases for the row space, column space and null space of A. 
Moreover, an examination of the number of elements in each of these bases 
will immediately result in the following theorem: 

THEOREM 3.5.1 Let A € M mxn (F). Then 

(a) column rank A = row rank A ; 

(b) column rank A+ nullity A = n. 

Finding a basis for R(A): The r non-zero rows of B form a basis for R(A) 
and hence row rank A = r. 

For we have seen earlier that R(A) = R(B). Also 

R{B) = (Bu,..,,B m *) 

= (Bu , . . . , B r *, 0 . . . , 0) 

— (-^*1* ; • • • i B r *) . 

The linear independence of the non-zero rows of B is proved as follows: Let 
the leading entries of rows 1, . . . , r of B occur in columns c\, ... ,c r . Suppose 
that 

xiBu + b x r B„ = 0. 

Then equating components ci, . . . , c r of both sides of the last equation, gives 
xi = 0, . . . , x r = 0, in view of the fact that B is in reduced row- echelon 
form. 

Finding a basis for C[A) \ The r columns A * Cl , . . . , A* Cr form a basis for 
C(A) and hence column rank A = r. For it is clear that columns ci, . . . , c r 
of B form the left-to-right basis for C(B) and consequently from parts (b) 
and (c) of Theorem 3 . 3 . 5 , it follows that columns ci, . . . , c r of A form the 
left-to-right basis for C(A). 
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Finding a basis for N(A): For notational simplicity, let us suppose that c\ = 
1 ,Cr = r. Then B has the form 


' 1 

0 

... 0 

blr+1 

' ' ' bln 

0 

1 

... 0 

&2r+l 

' ' ' b2n 

0 

0 

... 1 

brr+1 

brn 

0 

0 

... 0 

0 

... 0 

_ 0 

0 

... 0 

0 

... 0 


Then N(B) and hence N(A) are determined by the equations 
Xl — ( ^Ir+l^r+l T ' ' ' T ( b\ n)X n 


X r — ( &rr+ l)®r+l T ' ' ' T ( b rn )x n , 

where xy+ 1 , ... ,x n are arbitrary elements of F. Hence the general vector X 
in iV (A) is given by 



— x r+ 1 A l + • • • + X n X n — r . 


Hence N(A) is spanned by X\, . . . , X n _ r , as x r+ \, . . . , x n are arbitrary. Also 
Xi, . . . , X n - r are linearly independent. For equating the right hand side of 
equation 3.2 to 0 and then equating components r + 1, . . . , n of both sides 
of the resulting equation, gives x r +i = 0, . . . , x n = 0. 

Consequently X\, ... , X n _ r form a basis for N(A). 

Theorem 3.5.1 now follows. For we have 

row rank A = dim R(A) = r 
column rank A = dim C( A) = r. 

Hence 

row rank A = column rank A. 
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Also 

column rank A + nullity A = r + dim N ( A ) = r + (n — r) = n. 

DEFINITION 3.5.2 The common value of column rank A and row rank A 
is called the rank of A and is denoted by rank A. 


EXAMPLE 3.5.1 Given that the reduced row-echelon form of 

'1 15 1 4 ' 

A = 2-112 2 

3 060-3 



Solution. [1, 0, 2, 0, —1], [0, 1, 3, 0, 2] and [0, 0, 0, 1, 3] form a basis for 
I?(A). Also 



form a basis for C{A). 

Finally N(A) is given by 


x\ 

X2 

X3 

X4 

X 5 


-2x 3 -hx 5 ' -2 1 

—3x3 — 2 .T 5 —3 —2 

x 3 = x 3 1 +x 5 0 =x 3 Xi + x 5 X 2 , 

—3x5 0 —3 

x 5 . 0 _ 1 


where x 3 and X5 are arbitrary. Hence X\ and X 2 form a basis for N(A). 
Here rank A = 3 and nullity A = 2. 


EXAMPLE 3.5.2 Let A = ^ ^ Then B = ^ is the reduced 

2 4 J [ 0 0 

row-echelon form of A. 
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Hence [1, 2] is a basis for R(A) and ^ is a basis for C(A). Also iV(A) 
is given by the equation x± = —2x2, where X 2 is arbitrary. Then 


x\ 

X2 


-2x 2 

X2 


= x 2 


— 2 
1 


and hence “ 
Here rank A = 


is a basis for iV(A). 
1 and nullity A = 1. 


EXAMPLE 3.5.3 Let A = 

row-echelon form of A. 

Hence [1, 0], [0, 1] form a basis for R(A) while [1, 3], [2, 4] form a basis 
for C(A). Also N(A) = {0}. 

Here rank A = 2 and nullity A = 0. 


Then B = 


1 0 
0 1 


is the reduced 


We conclude this introduction to vector spaces with a result of great 
theoretical importance. 

THEOREM 3.5.2 Every linearly independent family of vectors in a sub- 
space S can be extended to a basis of S. 

Proof. Suppose S has basis A] .... , X m and that Y\, . . . ,Y r is a linearly 
independent family of vectors in S. Then 

S = (Xi, . . . , X m ) = (Y 1 ,...,Y r ,X 1 ,...,X m ), 

as each of Y\, ... ,Y r is a linear combination of X \, . . . , X m . 

Then applying the left-to-right algorithm to the second spanning family 
for S will yield a basis for S which includes Y±, . . . ,Y r . 


3.6 PROBLEMS 

1. Which of the following subsets of M 2 are subspaces? 

(a) [x, y] satisfying x = 2 y\ 

(b) [x, y\ satisfying x = 2 y and 2x = y, 

(c) [x, y\ satisfying x = 2y + 1; 

(d) [x, y\ satisfying xy = 0; 
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(e) [x, y\ satisfying x > 0 and y > 0 . 
[Answer: (a) and (b).] 

2. If X, Y, Z are vectors in M n , prove that 




3. Determine if X\ = 

' 1 ' 
0 

1 

, *2 = 

' 0 ' 
1 

1 

and A 3 = 

' 1 ' 
1 

1 

are linearly 

independent in M 4 . 

_ 2 _ 


_ 2 _ 


3 



4. For which real numbers A are the following vectors linearly independent 
in M 3 ? 


1 

1 


1 

T— 1 

1 

1 


1 — 

T— 1 

1 

1 

-1 

II 

(N 

A 

II 

CO 

* 

-1 

1 

T— 1 

1 

1 


1 

l 

1 — 1 

1 


1 

V 

1 


5. Find bases for the row, column and null spaces of the following matrix 
over Q: 

'll 201' 

2 2 5 0 3 

0 0 0 1 3 ' 

_ 8 11 19 0 11 _ 

6 . Find bases for the row, column and null spaces of the following matrix 
over Z 2 : 

'10 10 1 ' 

01011 
11110 ‘ 

0 0 110 


7. Find bases for the row, column and null spaces of the following matrix 
over Z 5 : 

'112 0 13 ' 

2 1 4 0 3 2 

0 0 0 1 3 0 ‘ 

3 0 2 4 3 2 
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8. Find bases for the row, column and null spaces of the matrix A defined 
in section 1.6, Problem 17. (Note: In this question, F is a field of four 
elements.) 

9. If X i , ... , A m form a basis for a subspace S, prove that 


Xi, X 1 + X 2 ,...,X 1 + --- + X m 


also form a basis for S. 


10. Let A = 

1; (b) rank A 


b c 
1 1 
= 2 . 


Find conditions on a, b, c such that (a) rank A = 


[Answer: (a) a = b = c; (b) at least two of a, b, c are distinct.] 

11. Let S be a subspace of F n with dim S = m. If Ad , ... , X m are vectors 
in S with the property that S = (Ai, . . . , X m ), prove that Xi ... , X m 
form a basis for S. 

12. Find a basis for the subspace S of M 3 defined by the equation 


x + 2y + 3z = 0. 

Verify that Y\ = [—1, —1, 1] * 65 and find a basis for S which includes 
Vi. 

13. Let X\, . . . , X m be vectors in F n . If Xi = Xj, where i < j, prove that 
Xi, . . . X rn are linearly dependent. 

14. Let Xi , . . . , X m+ \ be vectors in F n . Prove that 

dim (Ai, . . . , X m+ i) = dim (Ai, . . . , X m ) 
if A m _|_i is a linear combination of Ai, . . . , A m , but 

dim (Ai, . . . , A m+ i) = dim (Ai, . . . , X m ) + 1 

if A m _|_i is not a linear combination of Ai, . . . , X m . 

Deduce that the system of linear equations AX = B is consistent, if 
and only if 


rank[A|S] = rank A. 
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15. Let ai,...,a n be elements of F, not all zero. Prove that the set of 
vectors [xi, . . . , x n Y where x\, . . . , x n satisfy 

(i]Xi H 1- a n x n = 0 

is a subspace of F n with dimension equal to n — 1. 

16. Prove Lemma 3.2.1, Theorem 3.2.1, Corollary 3.2.1 and Theorem 3.3.2. 

17. Let R and S be subspaces of F n , with R C S. Prove that 

dim R < dim S 

and that equality implies R = S. (This is a very useful way of proving 
equality of subspaces.) 

18. Let R and S be subspaces of F n . If R U S is a subspace of F n , prove 
that R C S or S C R. 

19. Let X \, . . . , X r be a basis for a subspace S. Prove that all bases for S 
are given by the family Yf, . . . , Y r , where 

r 

Yi = ciijXj, 

3 = 1 

and where A = [a^] € M rxr (F) is a non-singular matrix. 



Chapter 4 

DETERMINANTS 


DEFINITION 4.0.1 If A = 


On «12 

0-21 a 22 

A. (also denoted by det A,) to be the scalar 


we define the determinant of 


det A — 011O22 — 0120.21- 


The notation 


On Oi2 

«21 022 


is also used for the determinant of A. 


If A is a real matrix, there is a geometrical interpretation of det A. If 
P = (x\, 2/1 ) and Q = (X2, 2/2) are points in the plane, forming a triangle 

xi Vi 
X 2 V 2 

of the triangle OPQ. For, using polar coordinates, 


with the origin O = ( 0 , 0 ), then apart from sign, 


is the area 
et x\ = r\ cos 9\ and 


2/i = r 1 sin 0 i, where r± = OP and 9 1 is the angle made by the ray OP with 
the positive x-axis. Then triangle OPQ has area \OP ■ OQ sin a, where 
a = ZPOQ. If triangle OPQ has anti-clockwise orientation, then the ray 


OQ makes angle $2 = 9\ + a with the positive x-axis. (See Figure 4 . 1 .) 


Also X2 = r'2 cos 02 and 2/2 = ^2 sin 02 . Hence 


Area OPQ 


-QP • OQ sin a 
^ OP • OQsin ( 0 2 - 0 i ) 

-QP • OQ(sin 02 cos 0 i — cos 02 sin 0 i) 

- (OQ sin 02 • OP cos 0 i — OQ cos 02 • OP sin 0 i) 
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V 



Figure 4.1: Area of triangle OPQ. 


1 

2 

1 

2 


(2/2^1 - X 2 2/l) 
2/1 

X 2 2/2 


Similarly, if triangle OPQ has clockwise orientation, then its area equals 

_i 2/1 

2 X 2 2/2 ' 

For a general triangle PiP 2 P 3 , with P, = (x*, yi ), i = 1, 2, 3, we can 
take Pi as the origin. Then the above formula gives 


x 2 - X\ 

2/2 - 2/1 

1 

or 

X'2 - Xl 

2/2 - 2/1 

x 3 - Xl 

2/3 - 2/1 

2 

1 

00 

H 

2/3 - 2/1 


according as vertices P 1 P 2 P 3 are anti-clockwise or clockwise oriented. 

We now give a recursive definition of the determinant of an n x n matrix 
A = [dij\, n > 3. 


DEFINITION 4.0.2 (Minor) Let Mij(A) (or simply M tJ if there is no 
ambiguity) denote the determinant of the (n — 1) x (n — 1) submatrix of A 
formed by deleting the i-th row and )-th column of A. (A fij(A) is called 
the ( i , j ) minor of A.) 


Assume that the determinant function has been defined for matrices of 
size (n — 1) x (n — 1). Then det A is defined by the so-called first-row Laplace 
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expansion : 

det A = aiiMn(A) — ai 2 -Mi 2 (A) + . . . + (— l) 1+n Mi n (A) 

n 

3 = 1 

For example, if A = is a 3 x 3 matrix, the Laplace expansion gives 
det A = aj i M\ \ (A) — a 1 2 44 ) 2 ( A) + 013.^13(^4) 


«22 

<323 

<321 

— <312 

<323 

. <321 

+ <313 

<322 

«32 

<333 

<331 

<333 

<331 

<332 


= <211(022033 — 023032) - 012(021033 - 023031) + 013(021032 — 022031) 

= 011022033 — 011023032 — 012021033 + 012023031 + 013021032 — 013022031. 

(The recursive definition also works for 2x2 determinants, if we define the 
determinant of a 1 x 1 matrix [t] to be the scalar t: 

det A = aiiMn(A) - a^M^A) = ona 2 2 - 012021 -) 


EXAMPLE 4.0.1 If P 1 P 2 P 3 is a triangle with P t = (x*, y*), * = 1, 2, 3, 
then the area of triangle P 1 P 2 P 3 is 


X\ 

yi 

1 

X'l 

2/1 

1 

X2 

2/2 

1 or 

-2 12 

2/2 

1 

X3 

2/3 

1 

X3 

2/3 

1 


according as the orientation of P 1 P 2 P 3 is anti-clockwise or clockwise. 
For from the definition of 3 x 3 determinants, we have 


Xi 

2/1 

1 

X2 

2/2 

1 

X3 

2/3 

1 


1 

( 2/2 

1 


X 2 

1 

x 2 

2/2 

— 

Xi 

1 

- 2/1 


1 

+ 

2 

V 2/3 

X 3 

X3 

2/3 

1 

X'2 - Xi 

2/2 

- 2/1 





2 

1 

CO 

H 

2/3 

- 2/1 






One property of determinants that follows immediately from the defini- 
tion is the following: 

THEOREM 4.0.1 If a row of a matrix is zero, then the value of the de- 
terminant is zero. 
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(The corresponding result for columns also holds, but here a simple proof 
by induction is needed.) 

One of the simplest determinants to evaluate is that of a lower triangular 
matrix. 

THEOREM 4.0.2 Let A = [a t j\- where a t j = 0 if i < j. Then 


det A = ana 2 2 • • -a nn . (4.1) 

An important special case is when A is a diagonal matrix. 

If A =diag (ai, . . . , a n ) then det A = a\ . . . a n . In particular, for a scalar 
matrix tl n , we have det (tl n ) = t n . 

Proof. Use induction on the size n of the matrix. 

The result is true for n = 2. Now let n > 2 and assume the result true 
for matrices of size n — 1. If A is n x n, then expanding det A along row 1 
gives 


det A 



022 

0 ., 

.. 0 

«11 

032 

033 • ' 

.. 0 


Oral 

O n 2 • ■ 

• • 

0ll(022 • • 

• Oran) 



by the induction hypothesis. 

If A is upper triangular, equation 4.1 remains true and the proof is again 
an exercise in induction, with the slight difference that the column version 
of theorem 4.0.1 is needed. 

REMARK 4.0.1 It can be shown that the expanded form of the determi- 
nant of an n x n matrix A consists of n\ signed products ±au 1 a 2 i 2 ■ ■ ■ a n i n , 
where (ii, 12 , ■ ■ ■ , i n ) is a permutation of (1, 2, . . . , n), the sign being 1 or 
— 1, according as the number of inversions of (i 1 , * 2 , • • • , in) is even or odd. 
An inversion occurs when i r > i s but r < s. (The proof is not easy and is 
omitted.) 

The definition of the determinant of an n x n matrix was given in terms 
of the first-row expansion. The next theorem says that we can expand 
the determinant along any row or column. (The proof is not easy and is 
omitted.) 
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det A = 1 y + i aijMjj(A) 

j = 1 

for i = 1, . . . , n (the so-called i-th row expansion) and 

n 

det A = 1 ) l+ j aijMjj(A) 

i= 1 

for j = 1 , . . . , n (the so-called j-th column expansion) . 

REMARK 4.0.2 The expression (— l) l+3 obeys the chess-board pattern 
of signs: 

' + - + ' 

- + -... 

+ - + ... ' 


The following theorems can be proved by straightforward inductions on 
the size of the matrix: 

THEOREM 4.0.4 A matrix and its transpose have equal determinants; 
that is 

det A 1 = det A. 

THEOREM 4.0.5 If two rows of a matrix are equal, the determinant is 
zero. Similarly for columns. 

THEOREM 4.0.6 If two rows of a matrix are interchanged, the determi- 
nant changes sign. 

EXAMPLE 4.0.2 If Pi = (x\, yi) and P -2 = (x 2 , 2 / 2 ) are distinct points, 
then the line through Pi and P 2 has equation 



x y 1 

xi yi 1 

X2 2/2 1 
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For, expanding the determinant along row 1 , the equation becomes 

ax + by + c = 0, 


where 


a = 


V l 1 
V2 1 


= 2/1 — 2/2 and b = - 


X\ 1 
22 1 


= 2 2 - X'l . 


This represents a line, as not both a and b can be zero. Also this line passes 
through Pi, i = 1 , 2 . For the determinant has its first and z-th rows equal 
if 2 = 2,; and y = yi and is consequently zero. 

There is a corresponding formula in three-dimensional geometry. If 
Pi, P2, P3 are non-collinear points in three-dimensional space, with Pj = 
(2 i, yi, Zi ), z = 1 , 2 , 3 , then the equation 


X 

y 


1 

21 

yi 

Zi 

1 

22 

y -2 

Z2 

1 

X3 

2/3 

Z3 

1 



represents the plane through Pi, P2, P3. For, expanding the determinant 
along row 1, the equation becomes ax + by + cz + d = 0, where 



yi zi 1 


21 Z\ 1 


21 2/1 1 

a = 

2/2 Z 2 1 

, b = - 

22 Z2 1 

■> c = 

22 2/2 1 


2/3 z 3 1 


23 z 3 1 


x 3 2/3 1 


As we shall see in chapter 6, this represents a plane if at least one of a, b, c 
is non-zero. However, apart from sign and a factor the determinant 
expressions for a, b, c give the values of the areas of projections of triangle 
P1P2P3 on the ( y , z), (2, z) and (2, y) planes, respectively. Geometrically, 
it is then clear that at least one of a, b, c is non-zero. It is also possible to 
give an algebraic proof of this fact. 

Finally, the plane passes through Pj, i = 1 , 2 , 3 as the determinant has 
its first and z-th rows equal if 2 = 2 i, y = yi, z = z^ and is consequently 
zero. We now work towards proving that a matrix is non-singular if its 
determinant is non-zero. 


DEFINITION 4.0.3 (Cofactor) The (z, j) cofactor of A, denoted by 
Cij(A) (or C'ij if there is no ambiguity) is defined by 

Cij(A) = (-1 y+iMijiA). 
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REMARK 4.0.3 It is important to notice that Cjj(A), like M l j(A) , does 
not depend on aij. Use will be made of this observation presently. 

In terms of the cofactor notation, Theorem 3.0.2 takes the form 

THEOREM 4.0.7 

n 

det A = ^2 aijGij(A) 

3 = 1 

for i = 1 , . . . , n and 

n 

det A = ^2 a ijCij(A ) 

i = 1 

for j = 1, . . . , n. 

Another result involving cofactors is 
THEOREM 4.0.8 Let A be an n x n matrix. Then 

n 

(a) Y2 aijCkj(A) =0 if i ± k. 
j = 1 

Also 

n 

(b) '2TcL ij Ci k (A)=0 if j±k. 

i = 1 

Proof. 

If A is n x n and i ^ k, let B be the matrix obtained from A by replacing 
row k by row i. Then det B = 0 as B has two identical rows. 

Now expand det B along row k. We get 

n 

0 = det B = Y2 bkjCkj(B) 

3 = i 

n 

— ' y y ^ &jjC k j (A) , 

3 = 1 


in view of Remark 4.0.3. 
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DEFINITION 4.0.4 (Adjoint) If A = [o n j\ is an n x n matrix, the ad- 
joint of A, denoted by adj A, is the transpose of the matrix of cofactors. 
Hence 

' C n C 2 1 • • • C nl ' 

C\2 C 22 • • • c n 2 

adj A = . . 

L Ci n c 2 n * * * C n n J 

Theorems 4.0.7 and 4.0.8 may be combined to give 
THEOREM 4.0.9 Let A be an n x n matrix. Then 
A(adj A) = (det A)I n = (adj A) A. 

Proof. 

n 

(• A adj A) ik = ^2 a ij (adj -4) jk 
3=1 

n 

= ^ y ^ ij Cfcj ( A) 

3=1 

= 5i k det A 
= ((det A)I n ) ik . 



5 

6 

-2 4 

6 

+ 3 4 

5 

00 

9 

1 

00 

9 

8 

8 


= -3 + 24-24 
= -3/0. 


det A 
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Also 


A" 1 


C n C 2 i C 31 

C 12 C 22 C 32 

C*13 C*23 C33 




5 

6 


2 

3 


2 

3 




8 

9 


8 

9 


5 

6 


1 


4 

6 


1 

3 


1 

3 


3 


8 

9 


8 

9 


4 

6 




4 

5 


1 

2 


1 

2 




8 

8 


8 

8 


4 

5 



-3 6 -3 

12 -15 6 

-8 8 -3 


The following theorem is useful for simplifying and numerically evaluating 
a determinant. Proofs are obtained by expanding along the corresponding 
row or column. 


THEOREM 4.0.10 The determinant is a linear function of each row and 
column. 

For example 


Oil + 

012 + o!\ 2 

013 + 0^3 


an 

012 

013 


a ii 

a 12 

a 13 

021 

022 

023 

= 

021 

022 

023 

+ 

021 

022 

023 

031 

032 

033 


031 

032 

033 


031 

032 

033 


tan 

tai2 

fOi3 


Oil 

012 

013 

021 

022 

023 

= t 

021 

022 

023 

031 

032 

033 


031 

032 

033 


COROLLARY 4.0.2 If a multiple of a row is added to another row, the 
value of the determinant is unchanged. Similarly for columns. 


Proof. We illustrate with a 3 x 3 example, but the proof is really quite 
general. 


an + ta 21 

ai2 + ttt22 

013 + ^«23 


Oil 

012 

013 


f021 

ta22 

ttt23 

021 

022 

023 

= 

021 

022 

023 

+ 

021 

022 

023 

031 

032 

033 


031 

032 

033 


031 

032 

“33 
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Oil 

012 

013 


021 

022 

023 

= 

0 21 

022 

023 

+ t 

021 

022 

023 


«3 1 

032 

033 


031 

032 

033 


Oil 

012 

013 





= 

0-21 

022 

023 

+ 1 x 0 




03 1 

032 

033 






Oil 

012 

013 





= 

021 

022 

023 






«3 1 

032 

033 






To evaluate a determinant numerically, it is advisable to reduce the matrix 
to row-echelon form, recording any sign changes caused by row interchanges, 
together with any factors taken out of a row, as in the following examples. 

EXAMPLE 4.0.4 Evaluate the determinant 

1 2 3 
4 5 6. 

8 8 9 


Solution. Using row operations R 2 — > R 2 — 4i?i and R 3 — > R% — 8 R± and 
then expanding along the first column, gives 


1 

2 

3 


1 

2 

3 

4 

5 

6 

= 

0 

-3 

-6 

8 

8 

9 


0 

-8 

-15 




= - 

-3 

1 

-8 

2 

-15 


-3 -6 

-8 -15 


-3 


1 2 
0 1 


= -3. 


EXAMPLE 4.0.5 Evaluate the determinant 

112 1 
3 14 5 
7 6 12' 
113 4 


1 

1 

2 

1 


1 

1 

2 

1 

3 

1 

4 

5 


0 

-2 

-2 

2 

7 

6 

1 

2 


0 

-1 

-13 

-5 

1 

1 

3 

4 


0 

0 

1 

3 


Solution. 
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= -2 


1 

0 

0 

0 


1 2 1 

1 1 -1 

-1 -13 -5 

0 1 3 


= -2 


11 2 1 
0 1 1-1 

0 0 -12 -6 

0 0 13 


= 2 


= 2 


11 2 1 
0 1 1-1 

0 0 13 

0 0 -12 -6 

112 1 I 



0 0 0 30 


EXAMPLE 4.0.6 (Vandermonde determinant) Prove that 


1 1 1 

a b c 

a 2 b 2 c 2 


( b — a) (c — a)(c — b). 


Solution. Subtracting column 1 from columns 2 and 3 , then expanding 
along row 1, gives 


1 

1 

1 


1 0 0 

a 

b 

c 

= 

a b — a c — a 

a 2 

b 2 

c 2 


a 2 b 2 — a 2 c 2 — a 2 


b — a c — a 
b 2 - a 2 c 2 - a 2 


= (b — a)(c — a ) 


1 1 

b + a c + a 


( b — a)(c — a)(c — b). 


REMARK 4.0.4 From theorems 4.0.6, 4.0.10 and corollary 4.0.2, we de- 
duce 


(a) det ( EijA ) = — det A, 

(b) det (Ei(t)A) = fdet A, if t / 0, 
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(c) det (Eij(t)A) =det A. 

It follows that if A is row-equivalent to B, then det B = cdet A, where c 7 ^ 0. 
Hence det B/Ott det A / 0 and det B = 0 44 det A = 0. Consequently 
from theorem 2.5.8 and remark 2.5.7, we have the following important result: 

THEOREM 4.0.11 Let A be an n x n matrix. Then 

(i) A is non-singular if and only if det 4/ 0; 

(ii) A is singular if and only if det A = 0; 

(iii) the homogeneous system AX = 0 has a non-trivial solution if and 
only if det A = 0. 

EXAMPLE 4.0.7 Find the rational numbers a for which the following 
homogeneous system has a non-trivial solution and solve the system for 
these values of a: 


x — 2y + 3z = 0 
ax + 3y + 2z = 0 
6 x + y + az = 0 . 

Solution. The coefficient determinant of the system is 



1 

-2 

3 


1 

-2 

3 

A = 

a 

3 

2 

= 

0 3 + 2a 

2 — 3a 


6 

1 

a 


0 

13 

a - 18 


3 + 2a 2 -3a 
13 a -18 

= (3 + 2a) (a — 18) — 13(2 — 3a) 

= 2a 2 + 6a-80 = 2(a + 8)(a-5). 

SoA = 044a = — 8 or a = 5 and these values of a are the only values for 
which the given homogeneous system has a non-trivial solution. 

If a = — 8 , the coefficient matrix has reduced row-echelon form equal to 

'1 0 -1 ' 

0 1 -2 
0 0 0 
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and so the complete solution is x = z, y = 2 z, with z arbitrary. If a = 5, 
the coefficient matrix has reduced row-echelon form equal to 

'10 1 ' 

0 1 -1 
_ 0 0 0 _ 

and so the complete solution is x = —z, y = z, with z arbitrary. 

EXAMPLE 4.0.8 Find the values of t for which the following system is 
consistent and solve the system in each case: 

x + y = 1 

tx + y = t 
(l + t)x + 2y = 3. 

Solution. Suppose that the given system has a solution (xo, yo)- Then the 
following homogeneous system 


x + y + z = 0 

tx + y + tz = 0 

(1 + t)x + 2y + 3z = 0 

will have a non-trivial solution 


x = xq , y = yo, z = — 1 - 


Hence the coefficient determinant A is zero. However 



1 

1 

1 


1 

0 

0 


A = 

t 

1 

t 


t 

1 -t 

o 

= 


1 + t 

2 

3 


1 + 1 

1 -t 2-t 



Hence t = 1 or t = 2. If t = 1, the given system becomes 


(1— f)(2— t). 


x + y = 1 

x + y = 1 

2x + 2y = 3 


which is clearly inconsistent. If t = 2, the given system becomes 


x + y 
2 x + y 
3x + 2 y 


1 

2 

3 
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which has the unique solution x = 1, y = 0. 

To finish this section, we present an old (1750) method of solving a 
system of n equations in n unknowns called Cramer’s rule . The method is 
not used in practice. However it has a theoretical use as it reveals explicitly 
how the solution depends on the coefficients of the augmented matrix. 

THEOREM 4.0.12 (Cramer’s rule) The system of n linear equations 
in n unknowns x\, ... ,x n 

^llX\ T ^12*^2 4“ * * * 4“ CLi n X n — ^1 
0*21X1 4 " 022X2 4 " ' • • 4 - Q2nX n = &2 


OniXi 4~ o n 2 X 2 4“ * * * 4~ ei nn x n — b n 
has a unique solution if A = det [a nj \ / 0, namely 

_ Ai _ A2 _ A n 

X 1 — , X2 — , • • • 1 X n — , 

where A,; is the determinant of the matrix formed by replacing the i-th 
column of the coefficient matrix A by the entries 61, b 2 , ... ,b n . 


Proof. Suppose the coefficient determinant A/0. Then by corollary 4.0.1, 
H _1 exists and is given by A _1 = ^ adj A and the system has the unique 
solution 


Xi 


' h ■ 


' Cn 

C21 • 

• • C nl ' 


' 6l ' 

X2 

= H- 1 

62 

1 

Cl 2 

C22 • 

•• C n 2 


b 2 






A 






Xn 


bn 


Cm 

c 2n • 

C n n 


bn 


1 

A 


b\C\l 4- b 2 C 2 l + • • • 4- bnC'nl 
b>2Ci2 4- &2C22 4- ... 4- b n C n 2 


bnC'in 4- b 2 C2 n 4" • • • 4" b n C n 


However the i-th component of the last vector is the expansion of A,; along 
column i. Hence 


X\ 


' Aj ' 


' Ai/A ' 

X2 

1 

“ A 

A2 

= 

a 2 /a 

X n 


. A « . 


. A n / A _ 
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4.1 PROBLEMS 


1. If the points Pi = ( 2 j, yf), i = 1, 2, 3, 4 form a quadrilateral with ver- 
tices in anti-clockwise orientation, prove that the area of the quadri- 
lateral equals 


1 ( 

Xl 

X2 

x 2 

x 3 

x 3 

24 

2 4 

2i 

— ( 



+ 


+ 


+ 


2 V 

yi 

V2 

V2 

2/3 

2/3 

2/4 

2/4 

2/1 


(This formula generalizes to a simple polygon and is known as the 
Surveyor’s formula.) 

2. Prove that the following identity holds by expressing the left-hand 
side as the sum of 8 determinants: 


a + x 

b + y 

c + z 

a 

b 

c 

x + u 

y + v 

z + w =2 

X 

y 

z 

u + a 

v + b 

w + c 

u 

V 

UJ 


3. Prove that 

n 2 (n + l) 2 (n + 2) 2 

(n + 1) 2 (n + 2) 2 (n + 3) 2 = — 8. 

(n + 2) 2 (n + 3) 2 (n + 4) 2 

4. Evaluate the following determinants: 






1 

2 

3 

4 

246 

427 

327 


o 

1 

-4 

A 

Q 

1014 

543 

443 

(b) 

— Z 

3 

— 4 

-1 

O 

2 

-342 

721 

621 




-1 





4 

3 

-2 


[Answers: (a) —29400000; (b) 900.] 

5. Compute the inverse of the matrix 

'1 0 -2 ' 

A= 3 1 4 

_ 5 2 -3 

by first computing the adjoint matrix. 

' -11 -4 2 ' 

[Answer: A -1 = ™ 29 7 —10 .] 

L 1 -2 1 _ 
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6. Prove that the following identities hold: 


2 a 

2b 

b — c 

2b 

2 a 

a + c 

a + b 

a + b 

b 

b + c 

b 

c 

c 

c + a 

a 

b 

a 

a + b 


—2 (a - b) 2 {a + b ), 


2 a(b 2 + c 2 ). 


7. Let Pi = (xi, yi), i = 1, 2, 3. If x±, X 2 , x% are distinct, prove that there 
is precisely one curve of the form y = ax 2 + bx + c passing through 
-Pi, P2 and P3. 


8. Let 


A 


1 1 -1 
2 3 k 
1 k 3 


Find the values of k for which det A = 0 and hence, or otherwise, 
determine the value of k for which the following system has more than 
one solution: 


x + y — z = 1 

2x + 3 y + kz = 3 
x + ky + 3z = 2. 

Solve the system for this value of k and determine the solution for 
which x 2 + y 2 + z 2 has least value. 

[Answer: k = 2; x = 10/21, y = 13/21, z = 2/21.] 

9. By considering the coefficient determinant, find all rational numbers a 
and b for which the following system has (i) no solutions, (ii) exactly 
one solution, (iii) infinitely many solutions: 

x — 2 y + bz = 3 
ax + 2z = 2 

5x + 2 y = 1 . 

Solve the system in case (iii). 

[Answer: (i) ab = 12 and a / 3, no solution; ab / 12, unique solution; 
a = 3, b = 4, infinitely many solutions; x = — |z+ y = \z— g, with 
z arbitrary.] 
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10. Express the determinant of the matrix 

112 1 

12 3 4 

2 4 7 2f + 6 

2 2 6 -t t 

as as polynomial in t and hence determine the rational values of t for 
which B~ l exists. 

[Answer: det B = (t — 2)(2f — 1); t / 2 and t / ^.] 

11 . If A is a 3 x 3 matrix over a field and det 4/0, prove that 

(i) det(adjA) = (det A) 2 , 

(ii) (adjA )- 1 = ^-^A = adj (A^ 1 ). 

12 . Suppose that A is a real 3x3 matrix such that A* A = 1 3 . 

(i) Prove that A*(A — I3) = —(A — I 3 Y. 

(ii) Prove that det A = ±1. 

(iii) Use (i) to prove that if det A = 1, then det (A — I 3 ) = 0. 

13. If A is a square matrix such that one column is a linear combination of 
the remaining columns, prove that det A = 0. Prove that the converse 
also holds. 

14. Use Cramer’s rule to solve the system 

— 2x + 3 y — z = 1 

x + 2y — z = 4 

—2x — y + z = —3. 

[Answer: x = 2, y = 3, z = 4.] 

15. Use remark 4.0.4 to deduce that 

det Eij = — 1, det E t (t) = t, det Eij(t) = 1 
and use theorem 2.5.8 and induction, to prove that 

det (BA) = det B det A, 

if B is non-singular. Also prove that the formula holds when B is 
singular. 
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16. Prove that 


a + b + c 
a + b 
a 
a 


a + b 
a + b + c 
a 
a 


a 

a 

a + b + c 
a + b 


a 

a 

a + b 
a + b + c 


c 2 (2b + c)(4a + 2b+c). 


17. Prove that 


1 + U\ 

Ui 

U\ 

Ul 

U2 

1 + U2 

U2 

U2 

u-s 

U3 

1 + U 3 

U3 

U4 

U\ 

U4 

1 + U4 


I + U1 + U2 + U3 + U4. 


18. Let A £ M nxn (F). If A * = —A, prove that det A = 0 if n is odd and 
1 + 1 / 0 in F. 


19. Prove that 


1111 
rill 
r r 11 

r r r 1 


(l-r) 3 . 


20. Express the determinant 


1 a 2 — be a 4 
1 b 2 — ca b 4 
1 c 2 — ab c 4 


as the product of one quadratic and four linear factors. 

[Answer: (6 — a)(c — a)(c — b)(a + b + c)(b 2 + be + c 2 + ac + ab + a 2 ).] 



Chapter 5 

COMPLEX NUMBERS 


5.1 Constructing the complex numbers 


One way of introducing the field C of complex numbers is via the arithmetic 
of 2 x 2 matrices. 

DEFINITION 5.1.1 A complex number is a matrix of the form 


where x and y are real numbers. 
Complex numbers of the form 


x -y 
V x 


x 0 
0 x 


are scalar matrices and are called 


real complex numbers and are denoted by the symbol {x}. 

The real complex numbers {x} and {y} are respectively called the real 

x -y 


part and imaginary part of the complex number 
The complex number 
We have the identities 


0 -1 

1 0 


y x 

is denoted by the symbol i. 


X -y 


X 

0 ' 

+ 

' 0 

-y 


X 

0 ' 

+ 

' 0 

-1 ' 


y 

o ' 

y x _ 


0 

X 

. y 

0 


0 

X 

1 

0 


0 

y . 


= {x} + i{y}, 


i 2 = 


— i 

0 

1 

1 — 1 

1 


' 0 -1 ' 


1 

o 

T— 1 

1 

L 1 0 J 


1 0 


L o -i j 


= {-!}• 
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Complex numbers of the form i{y}, where y is a non-zero real number, are 
called imaginary numbers. 

If two complex numbers are equal, we can equate their real and imaginary 
parts: 

{xi} + i{yi} = {x 2 } + i{y 2 } =>■ x± = x 2 and y\ = y 2 , 

if xi, x 2 , yi, y 2 are real numbers. Noting that {0} + ?'{0} = {0}, gives the 
useful special case is 

{x} + i{y} = {0} x = 0 and y = 0, 

if x and y are real numbers. 

The sum and product of two real complex numbers are also real complex 
numbers: 

M + M = {x + y}, MM = {xy}. 

Also, as real complex numbers are scalar matrices, their arithmetic is very 
simple. They form a field under the operations of matrix addition and 
multiplication. The additive identity is {0}, the additive inverse of {x} is 
{— x}, the multiplicative identity is {1} and the multiplicative inverse of {x} 
is {x -1 }. Consequently 

{x} - {y} = {x} + (-{y}) = {x} + {-y} = {x - y}, 

= MM -1 = MG/ -1 } = { xy _1 } = {“} • 

It is customary to blur the distinction between the real complex number 
{x} and the real number x and write {x} as x. Thus we write the complex 
number {x} + i{y} simply as x + iy. 

More generally, the sum of two complex numbers is a complex number: 

(xi + iyi) + (x 2 + iy 2 ) = (xi + x 2 ) + i(yi + y 2 ); (5.1) 

and (using the fact that scalar matrices commute with all matrices under 
matrix multiplication and {— 1}A = —A if A is a matrix), the product of 
two complex numbers is a complex number: 

(xi + iyi)(x 2 + iy 2 ) = x x (x 2 + iy 2 ) + (%)(x 2 + iy 2 ) 

= xix 2 + xi(*y 2 ) + (iyi)x 2 + ( iyi)(iy 2 ) 

= xix 2 + ixiy 2 + zyix 2 + i 2 yiy 2 
= (xix 2 + {— l}yiy 2 ) + i(x\y 2 + yix 2 ) 

= (xix 2 - yiy 2 ) + i(x\y 2 + yix 2 ), 


(5.2) 
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The set C of complex numbers forms a field under the operations of 
matrix addition and multiplication. The additive identity is 0, the additive 
inverse of x + iy is the complex number (— x) + the multiplicative 

identity is 1 and the multiplicative inverse of the non-zero complex number 
x + iy is the complex number u + iv, where 


u = — ^ and v 
x z + y z 



(If x + iy / 0, then x / 0 or y / 0, so x 2 + y 2 / 0.) 

From equations 5.1 and 5.2, we observe that addition and multiplication 
of complex numbers is performed just as for real numbers, replacing i 2 by 
— 1, whenever it occurs. 

A useful identity satisfied by complex numbers is 


r 2 + s 2 = (r + is)(r — is). 


This leads to a method of expressing the ratio of two complex numbers in 
the form x + iy, where x and y are real complex numbers. 


x\ + iyi _ (xi +iyi)(x 2 - jyg) 

X 2 + iy 2 (x 2 +iy 2 )(x 2 - iy 2 ) 

(■ X\X 2 + mm) + i(-xiy 2 + yix 2 ) 
x\ + yl 


The process is known as rationalization of the denominator. 


5.2 Calculating with complex numbers 

We can now do all the standard linear algebra calculations over the field of 
complex numbers - find the reduced row-echelon form of an matrix whose el- 
ements are complex numbers, solve systems of linear equations, find inverses 
and calculate determinants. 

For example, 

= (1 + z)(8 — 2i) — 7(2 — i) 

= (8 - 2 i) + i{ 8 - 2 i) -14 + 7* 

= -4+13i/0. 


1 + i 2 — i 
7 8-2 i 
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Then by Cramer’s rule, the linear system 

(1 + i)z + (2 — i)w = 2 + 7 ? 

7z + (8 — 2 i)w = 4 — 9 ? 

has the unique solution 

2 + 7i 2 - ? 

4 — 9 % 8 — 2 ? 

—4 + 13?' 

(2 + 7?) (8 - 2 ?) - (4 - 9i)(2 - ?) 

-4 + 13* 

2(8 - 2 ?) + (7?)(8 - 2*) - {(4(2 - ?) - 97(2 - ?)} 

-4 + 13? 

16 - 4? + 56? - 14? 2 - {8 - 4? - 18? + 9? 2 } 

-4 + 13? 

31 + 74? 

-4 + 13? 

(31 + 74?)(-4- 13?) 

(-4) 2 + 13 2 
838 - 699? 

(-4) 2 + 13 2 
838 699 . 

185 _ 185* 

, . , -698 229. 

and similarly w = 1 ?. 

185 185 

An important property enjoyed by complex numbers is that every com- 
plex number has a square root: 

THEOREM 5.2.1 

If w is a non-zero complex number, then the equation z 2 = w has precisely 
two solutions z£ C. 

Proof. Case 1. Suppose 6 = 0. Then if a > 0, z = \fa is a solution, while 
if a < 0, ?\/— a is a solution. 

Case 2. Suppose 6/0. Let z = x + iy, w = a + ?6, x, y, a, 6 £ M. Then 
the equation z 2 = w becomes 


( x + iy) 2 = x 2 — y 2 + 2 xyi = a + ib, 
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so equating real and imaginary parts gives 


x 2 — y 2 = a and 2 xy = b. 


Hence x^O and y = b/(2x). Consequently 

b\ 2 


x 2 — 


2x 


= a, 


so 4x 4 — 4 ax 2 — b 2 = 0 and 4(x 2 ) 2 — 4a(x 2 ) — b 2 = 0. Hence 


x 2 = 


4 a ± \/l6 a 2 + 166 2 a ± \/ a 2 + b 2 


However x 2 > 0, so we must take the + sign, as a — \/ a 2 + b 2 < 0. Hence 


2 a + V a 2 + b 2 

x = , x = ±\ 


+ V a 2 + b 2 


Then y is determined by y = b/(2x). 

EXAMPLE 5.2.1 Solve the equation z 2 = 1 + i. 
Solution. Put z = x + iy. Then the equation becomes 
(x + iy) 2 = x 2 - y 2 + 2 xyi = 1 + i, 
so equating real and imaginary parts gives 

x 2 — y 2 = 1 and 2 xy = 1. 
Hence x / 0 and y = b/{2x). Consequently 

1 


*-—=!> 

' 2x 1 


so 4.x 4 — 4x 2 — 1 = 0. Hence 


x 2 = 


4± v/16 + 16 1 ± v/2 


Hence 


2 1 + V^ ^ 

1 and * = • 
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Then 

y = — = ± / 

2 * V2y/l + V2 

Hence the solutions are 


EXAMPLE 5.2.2 Solve the equation z 2 + (\/3 + i)z +1 = 0. 

Solution. Because every complex number has a square root, the familiar 
formula 

—b ± \/b 2 — 4 ac 


for the solution of the general quadratic equation az 2 + bz + c = 0 can be 
used, where now a(/ 0), b, c G C. Hence 

-{V3 + i)±J(V3 + i) 2 -4: 

2 ~ 2 

-( V3 + i) ± ^/(3 + 2\/3i — 1) — 4 

2 

-(v/3 + i) ± v/-2 + 2v/3i 

2 ’ 

Now we have to solve re 2 = —2 + Put w = x + iy. Then w 2, = 

x 2 — y 2 + 2xyi = — 2 + 2\/3i and equating real and imaginary parts gives 
x 2 — y 2 = —2 and 2xy = 2\/3. Hence y = and so x 2 — 3/x 2 = —2. So 

x 4 + 2x 2 — 3 = 0 and (x 2 + 3)(x 2 — 1) = 0. Hence x 2 — 1 = 0 and x = ±1. 
Then y = ±y / 3. Hence (1 + v/3^) 2 = —2 + 2 \/3'i and the formula for z now 
becomes 


z 


— \/3 — i ± (1 + \/3 i) 

2 

1 - V3 + (1 + y/3)i -1 - y/% - (1 + V3)i 

or 

2 2 


EXAMPLE 5.2.3 Find the cube roots of 1. 
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Solution. We have to solve the equation z 3 = 1, or z 3 — 1 = 0. Now 
z 3 — 1 = (z — l)(z 2 + z + 1). So z 3 — 1 = 0 =>- z — 1 = 0 or z 2 + z + 1 = 0. 


But 


z 2 + z + l = 0^z = 


-1 ± v/1 2 -4 
2 


-1 ± VSi 

2 


So there are 3 cube roots of 1, namely 1 and (—1 ± \/3*)/2. 

We state the next theorem without proof. It states that every non- 
constant polynomial with complex number coefficients has a root in the 
field of complex numbers. 


THEOREM 5.2.2 (Gauss) If f(z) = a n z n + a n -iz n 1 + • • • + a\z + ao, 
where a n / 0 and n > 1, then f(z) = 0 for some z G C. 


It follows that in view of the factor theorem, which states that if a € F is 
a root of a polynomial f(z) with coefficients from a field F, then z — a is a 
factor of f(z), that is f(z) = (z — a)g(z), where the coefficients of g(z) also 
belong to F. By repeated application of this result, we can factorize any 
polynomial with complex coefficients into a product of linear factors with 
complex coefficients: 


f(z) = a n (z - zi)(z -z 2 )---(z- z n ). 


There are available a number of computational algorithms for finding good 
approximations to the roots of a polynomial with complex coefficients. 


5.3 Geometric representation of C 

Complex numbers can be represented as points in the plane, using the cor- 
respondence x + iy (x, y) . The representation is known as the Arganct 
diagram or complex plane. The real complex numbers lie on the x-axis, 
which is then called the real axis, while the imaginary numbers lie on the 
y-axis, which is known as the imaginary axis. The complex numbers with 
positive imaginary part lie in the upper half plane, while those with negative 
imaginary part lie in the lower half plane. 

Because of the equation 

(xi + iyi) + (x 2 + * 2 / 2 ) = (xi + x 2 ) + i(y 1 + y 2 ), 

complex numbers add vectorially, using the parallellogram law. Similarly, 
the complex number z\ — z 2 can be represented by the vector from [x 2 , y 2 ) 
to (xi, y\), where z\ = x\ + iy\ and z 2 = x 2 + iy 2 . (See Figure 5.1.) 
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Figure 5.1: Complex addition and subraction. 


The geometrical representation of complex numbers can be very useful 
when complex number methods are used to investigate properties of triangles 
and circles. It is very important in the branch of calculus known as Complex 
Function theory, where geometric methods play an important role. 

We mention that the line through two distinct points Pi = (aq, yi) and 
P -2 = ( x 2 , 2 / 2 ) has the form z = (1 — t)zi + tz 2 , i 6 M, where z = x + iy is 
any point on the line and z\ = x t + iyt, i = 1, 2. For the line has parametric 
equations 

X = (1 - t)xi +tx 2 , y= (1 -t)yi +ty 2 

and these can be combined into a single equation z = (1 — t)z\ + tz 2 . 

Circles have various equation representations in terms of complex num- 
bers, as will be seen later. 

5.4 Complex conjugate 

DEFINITION 5.4.1 (Complex conjugate) If z = x + iy, the complex 
conjugate of z is the complex number defined by z = x — iy. Geometrically, 
the complex conjugate of z is obtained by reflecting z in the real axis (see 
Figure 5.2). 

The following properties of the complex conjugate are easy to verify: 
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Figure 5.2: The complex conjugate of z: z. 


1. z\ + Z 2 = z7 + z^; 

2. —z = — z. 

3 . z\ - Z 2 = z7~W, 

4 . zTM = z7W, 

5. (1 /z) = l/z) 

6 . {Z\/Z2) = zi/z2] 

7. z is real if and only if z = z; 


8. With the standard convention that the real and imaginary parts are 
denoted by Re z and Irn z, we have 

z + z 


Re z = 


Im z = 


z — z 

2 i ’ 


9. If z = x + iy , then zz = x 2 + y 2 . 


THEOREM 5.4.1 If f(z) is a polynomial with real coefficients, then its 
non-real roots occur in complex-conjugate pairs, i.e. if f(z ) = 0, then 

m = o. 


Proof. Suppose f(z) = 

: a n z n + a n -\z n 1 + • • • 

+ a\z + a 0 = 

a n , ... ,ci o are real. Then 



0 = 0 = /(z) 

= a n z n + a n -\z n ~ l + • 

■ ■ + ajz + aT 


= a n z n + a n _i z n ~ l + 

h ajz + ajj 


= a n z n + a n -\z n 1 + • 

• • + ajz + ao 


= /(*)■ 



0, where 
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EXAMPLE 5.4.1 Discuss the position of the roots of the equation 




4 


= -1 


in the complex plane. 

Solution. The equation z 4 = —1 has real coefficients and so its roots come 
in complex conjugate pairs. Also if z is a root, so is — z. Also there are 
clearly no real roots and no imaginary roots. So there must be one root w 
in the first quadrant, with all remaining roots being given by iv, —w and 
— w. In fact, as we shall soon see, the roots lie evenly spaced on the unit 
circle. 

The following theorem is useful in deciding if a polynomial /(z) has a 
multiple root a; that is if (z — a) m divides f(z ) for some rn > 2. (The proof 
is left as an exercise.) 

THEOREM 5.4.2 If /(z ) = (z — a) m g(z ), where rn > 2 and g(z ) is a 
polynomial, then f'(a) = 0 and the polynomial and its derivative have a 
common root. 

From theorem 5.4.1 we obtain a result which is very useful in the explicit 
integration of rational functions (i.e. ratios of polynomials) with real coeffi- 
cients. 

THEOREM 5.4.3 If /(z) is a non-constant polynomial with real coeffi- 
cients, then /(z) can be factorized as a product of real linear factors and 
real quadratic factors. 

Proof. In general /(z) will have r real roots zi,...,z r and 2s non-real 
roots z r+ i, z r+ i, . . . , z r+s , z r+s , occurring in complex-conjugate pairs by 
theorem 5.4.1. Then if a n is the coefficient of highest degree in /(z), we 
have the factorization 

/(z) = a n (z — zi) • • • (z — z r ) X 

x(z ■Zr+l ) (^ -2-r-Tl) (z Z r ._|_ s )(z Z r -f- s ). 

We then use the following identity for j = r + 1 , . . , , r + s which in turn 
shows that paired terms give rise to real quadratic factors: 

(z — Zj)(z — Zj ) = Z 2 — (Zj + Zj)z + ZjZj 

= z 2 - 2 Re Zj + (x 2 + y 2 ), 

where Zj = Xj + iyj . 

A well-known example of such a factorization is the following: 
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Figure 5.3: The modulus of z: \z 


EXAMPLE 5.4.2 Find a factorization of z 4 + 1 into real linear and quadratic 
factors. 

Solution. Clearly there are no real roots. Also we have the preliminary 
factorization z 4 + 1 = ( z 2 — i)(z 2 + i). Now the roots of z 2 — i are easily 
verified to be ±(1 + i)/y/2, so the roots of z 2 + i must be ±(1 — i)/V 2. 
In other words the roots are w = (1 + i)/V 2 and w, —w, —W. Grouping 
conjugate-complex terms gives the factorization 

z 4 + 1 = (z — w)(z — w)(z + w)(z + w) 

= ( z 2 — 2zRe w + iuw) (z 2 + 2zRe w + ww ) 

= (z 2 - V2z+l)(z 2 + V2z + 1). 

5.5 Modulus of a complex number 

DEFINITION 5.5.1 (Modulus) If z = x + iy, the modulus of z is the 
non-negative real number \z\ defined by \z\ = \J x 1 + y 2 . Geometrically, the 
modulus of z is the distance from z to 0 (see Figure 5.3). 

More generally, \z± — Z 2 I is the distance between z\ and z 2 in the complex 
plane. For 


zi - z 2 \ = |(xi + iyi) - (x 2 + *y 2 )| = \(xi - x 2 ) + i(yi - y 2 )\ 

= V( x i - x ? ) 2 + ( 2/1 - 1J2) 2 ■ 


The following properties of the modulus are easy to verify, using the identity 
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(h) i^- 1 ! = id- 1 - 


ah) - = 

Z 2 \Z2\ 

For example, to prove (i): 

\ziz 2 \ 2 = (ziz 2 )z^ = (ziz 2 )X[ zi 

= (z 1 Zi)(z 2 Z 2 ) = I ^1 1 2 1 ^2 1 2 = ( | 1 1 - 2-2 | ) 2 • 

Hence \z\z 2 \ = | 1 1 ^ 2 1 - 

11 4- i) 4 

EXAMPLE 5.5.1 Find Id when 2 = ddld -. 

1 1 (1 + 6i)(2 — 7i) 

Solution. 


1 _ |l + i| 4 

' Z ' ~ |1 + 6i||2 — 7i\ 

(\ZlHh) 4 
Vl 2 + 6 2 y / 2 2 + (-7) 2 
4 

y/37y/53' 

THEOREM 5.5.1 (Ratio formulae) If z lies on the line through z\ and 

22: 

z = ( 1 — t)z\ + tz 2 , t g M, 

we have the useful ratio formulae: 

Z ~ Z1 f -r u 

(i) — = t if z + z 2 , 


z — Z\ 

t 

z - z 2 

1 -t 

n 

1 

= \t\. 

\Zl - z 2 | 



Circle equations. The equation | z — zq\ = r, where zq G C and r > 
0, represents the circle centre zq and radius r. For example the equation 
\z — (1 + 2i)\ = 3 represents the circle (x — l) 2 + (y — 2) 2 = 9. 

Another useful circle equation is the circle of Apollonius : 

I z — a I 
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Figure 5.4: Apollonius circles: |g§| = 1, L || f, f ’ I’ f 


where a and b are distinct complex numbers and A is a positive real number, 
A / 1. (If A = 1, the above equation represents the perpendicular bisector 
of the segment joining a and b.) 

An algebraic proof that the above equation represents a circle, runs as 
follows. We use the following identities: 

(i) \z — a\ 2 = \z\ 2 — 2Re (za) + |a| 2 

(ii) Re (zi ± Z 2 ) = Re z\ ± Re z 2 

(iii) Re ( tz ) = tRe z if t E M. 

We have 




z-b r 1 

2 — 2Re {za} 
2 \| „\2 2 


Z 


(1-A 2 )N : 


zl 2 - 2 Re 



fa — A 2 b 

VT^a^ 


= X 2 \z-b\ 2 

A 2 (|^| 2 — 2Re{^6} + |6| 2 ) 
- A 2 6)} = A 2 1 6| 2 - |a| 2 
_ \ 2 \b\ 2 - \a\ 2 


z I 2 — 2Re 



fa — \ 2 b\ 


a — A 2 b 

2 A 2 6 2 — a 2 

a — A 2 b 

1 — A 2 

1 - A 2 + 

1- A 2 
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Now it is easily verified that 


|a - X 2 b\ 2 + (1 - A 2 ) (A 2 1 6| 2 - |a| 2 ) = A 2 |a - b\ 2 . 

So we obtain 

2 X 2 \a — b\ 2 
= |1 — A 2 | 2 
A|o — b\ 

= \T^W\- 

The last equation represents a circle centre zo, radius r, where 

a — X 2 b A|a — b\ 


z — a 


z-b 


= X 45 


45 


Z — 


Z — 


a — X 2 b 
1- A 2 
a — A 2 b 
1- A 2 


There are two special points on the circle of Apollonius, the points z\ and 
z 2 defined by 


zi- a z 2 - a 

= A and - 

z\ — b z 2 — b 


-x, 


or 


zi 


a — Xb 
1 - A 


and 


^2 


cl T Xb 

1 + A 


(5.3) 


It is easy to verify that z\ and z 2 are distinct points on the line through a 
and b and that zq = Z1 \ Z2 ■ Hence the circle of Apollonius is the circle based 
on the segment z \ , z 2 as diameter. 


EXAMPLE 5.5.2 Find the centre and radius of the circle 
\z — 1 — i\ = 2| z — 5 — 2i\. 

Solution. Method 1. Proceed algebraically and simplify the equation 


x + iy — 1 — i\ = 2\x + iy — 5 — 2i\ 


or 

\x - 1 + i(y - 1)| = 2\x - 5 + i(y - 2)|. 
Squaring both sides gives 


{x - l) 2 + (y- l) 2 = 4((x - 5) 2 + (y- 2) 2 ), 


which reduces to the circle equation 
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Completing the square gives 


(x 






-38 = 


68 

~9~’ 


so the centre is (^, |) and the radius is y if' 

Method 2. Calculate the diametrical points z\ and Z 2 defined above by 
equations 5.3: 


z\ — 1 — i = 2(zi — 5 — 2 i) 

Z2 — 1 — i = —2(^2 — 5 — 2 i). 

We find zi = 9 + 3i and Z 2 = (11 + 5i)/3. Hence the centre zq is given by 

z\ + Z 2 19 7 . 

zo = — x — = xr + M 


and the radius r is given by 



/ 19 7 \ , 


1 8 2 1 

II 

0 

1 

II 

(y + 3V - (9+3?) 

— 

•<s> 

1 CO 

I 

1 CO 

1 


5.6 Argument of a complex number 

Let z = x + iy be a non-zero complex number, r = \z\ = y / x 2 + y 2 . Then 
we have x = r cos 9, y = r sin 9, where 9 is the angle made by z with the 
positive x-axis. So 9 is unique up to addition of a multiple of 27r radians. 

DEFINITION 5.6.1 (Argument) Any number 9 satisfying the above 
pair of equations is called an argument of z and is denoted by arg z. The 
particular argument of z lying in the range — n < 9 < it is called the principal 
argument of z and is denoted by Arg z (see Figure 5.5). 

We have z = r cos 9 + ir sin 9 = r( cos 9 + i sin 9) and this representation 
of z is called the polar representation or modulus-argument form of z. 

EXAMPLE 5.6.1 Argl = 0, Arg(— 1) = 7 r, Argi = Arg(— i) = — |. 

We note that y/x = tan# if x / 0, so 9 is determined by this equation up 
to a multiple of 7r. In fact 


Arg z = tan 


-l V 


x 


+ kn, 
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Figure 5 . 5 : The argument of z: aigz = 9 . 


where k = 0 if x > 0' k = 1 if x < 0, y > 0; k = — 1 if x < 0, y < 0. 

To determine Arg z graphically, it is simplest to draw the triangle formed 
by the points 0, x, z on the complex plane, mark in the positive acute angle 
a between the rays 0 , x and 0 , z and determine Arg z geometrically, using 
the fact that a = tan -1 (|y|/|x|), as in the following examples: 

EXAMPLE 5.6.2 Determine the principal argument of z for the followig 
complex numbers: 


z = 4 + 3 i, —4 + 3 i, —4 — 3 i, 4 — 3 i. 

Solution. Referring to Figure 5 . 6 , we see that Arg z has the values 

a, ir — a, —ir + a , —a, 


where a = tan 1 |. 

An important property of the argument of a complex number states that 
the sum of the arguments of two non-zero complex numbers is an argument 
of their product: 

THEOREM 5.6.1 If 6± and 02 are arguments of z± and Z 2 , then 9\ + 02 
is an argument of ziZ 2 - 

Proof. Let z\ and Z 2 have polar representations z\ = ri(cos 0 i + isin$i) 
and Z2 = r2(cos#2 + *sin02)- Then 

Z\Z2 = n (cos 6*i + i sin 0i)r2(cos 02 + i sin 02) 

= rir2(cos 9 \ cos 02 — sin 9 \ sin 02 + i( cos 9 \ sin 02 + sin 0 \ cos #2)) 

= rir 2 (cos ( 9 \ + 0 2 ) + isin (6*1 + 0 2 )), 
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Figure 5.6: Argument examples. 


which is the polar representation of Z\Z 2 , as rir 2 = |^i||^ 2 | = |~i~ 2 |- Hence 
9 1 + 02 is an argument of z±Z 2 ■ 

An easy induction gives the following generalization to a product of n 
complex numbers: 

COROLLARY 5.6.1 U6 1 ,...,0 n are arguments for zi, ... ,z n respectively, 
then 6 1 + ■ ■ ■ + 9 n is an argument for z\ ■ ■ ■ z n . 

Taking 9\ = ■ ■ ■ = 9 n = 9 in the previous corollary gives 

COROLLARY 5.6.2 If 9 is an argument of z, then n9 is an argument for 

z n . 

THEOREM 5.6.2 If 9 is an argument of the non-zero complex number 
z, then —9 is an argument of z~ l . 

Proof. Let 9 be an argument of z. Then z = r(cos 9 + i sin 0), where r = \z\. 
Hence 

z _1 = r _1 (cos6 ) + zsin#) -1 

= r _1 (cos6 ) — isin#) 

= r _1 (cos(— 9) + isin(— 9)). 
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Now r 1 = \z\ 1 = | z 1 |, so —6 is an argument of z 1 . 

COROLLARY 5.6.3 If 6\ and 62 are arguments of z\ and Z2, then 6\ —62 
is an argument of 21/22. 

In terms of principal arguments, we have the following equations: 


(i) 

Arg (2122) 

= Arg 2i+ Arg 2 2 + 2 Aq 7 r, 

(ii) 

Arg (2' 1 ) 

= —Arg 2 + 2 A: 2 7 T, 

(iii) 

Arg (21/22) 

= Arg 21 -Arg 22 + 2 A; 3 7 t, 

(iv) 

Arg (21 • • • 2 n ) 

= Arg 21 H I- Arg z n + 2 /c 4 7 t, 

(v) 

Arg (z n ) 

= n Arg 2 + 2 & 57 T, 


where k\ , k 2, ks, £q, k$ are integers. 


In numerical examples, we can write (i), for example, as 
Arg (2:122) = Arg 21 + Arg 22- 


EXAMPLE 5.6.3 Find the modulus and principal argument of 

17 


2 = 


n /3 + % 
1 + i 


and hence express 2 in modulus-argument form. 

\Vs + i \ 17 2 17 


Solution. 


z = 


1 + i 


»' 1 17 


(V 2 ) 


17 


= 2 17 / 2 . 


Arg 2 


17 Arg 


( a /3 + i \ 

[TTV) 


17 (Arg (Vs + i) - Arg (1 + *)) 



- 177 r 
12 


Hence Arg 2 = ( (^ ) + 2 k-K, where k is an integer. We see that k = 1 and 
hence Arg 2 = -j|. Consequently 2 = 2 17//2 (cos + * sin 

DEFINITION 5.6.2 If 6 is a real number, then we define e ld by 


e l ° = cos# + zsin#. 


More generally, if z = x + iy, then we define e z by 

e 2 = e x e iy . 
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For example, 

Z7T • j/K_ 

e 2 = *, e 1 = - 1, e * = -i. 

The following properties of the complex exponential function are left as 
exercises: 


THEOREM 5.6.3 

(i) e Zl e Z2 

= 

e zi+z 2 ^ 


(ii) e zi ■ ■ ■ e Zn 

= 

\~Z n 


(iii) e z 


0 , 


(iv) (e *) -1 

— 

e" z , 


(v) e Zl / e z 2 

— 

e Zl ~ Z2 , 


(vi) e z 

— 

e z . 

THEOREM 5.6.4 

The equation 




e z = 1 

has the complete solution z = 2kiri, k £ Z. 

Proof. First we observe that 

e 2km = cos (2/e7r) + ism (2A:7r) = 1. 

Conversely, suppose e z = 1, z = x + iy. Then e x (cos y + is'my) = 1. Hence 
e x cosy = 1 and e x smy = 0. Hence siny = 0 and so y = nir, n GZ. Then 
e x cos (n 7r) = 1, so e a; (— l) n = 1, from which follows (— l) n = 1 as e x > 0. 
Hence n = 2k, k £ Z and e x = 1. Hence x = 0 and z = 2/c7ri. 

5.7 De Moivre’s theorem 

The next theorem has many uses and is a special case of theorem 5.6.3(H). 
Alternatively it can be proved directly by induction on n. 

THEOREM 5.7.1 (De Moivre) If n is a positive integer, then 

(cos 6 + i sin 6) n = cos n6 + i sin nO. 

As a first application, we consider the equation z n = 1. 

THEOREM 5.7.2 The equation z n = 1 has n distinct solutions, namely 
the complex numbers Cfc = e~^~ , k = 0, 1, ...,n — 1. These lie equally 
spaced on the unit circle \z\ = 1 and are obtained by starting at 1, moving 
round the circle anti-clockwise, incrementing the argument in steps of — . 
(See Figure 5.7) 

27 Tl 

We notice that the roots are the powers of the special root ( = e ~ . 



108 


CHAPTER 5. COMPLEX NUMBERS 



Proof. With ^ defined as above, 


Q 


( Ik'Ki \ ^ 2/c7ri 

e—J = e — n = l, 


by De Moivre’s theorem. However |(L| = 1 and arg(L = so the com- 
plex numbers k = 0, 1, ... ,n — 1, he equally spaced on the unit circle. 
Consequently these numbers must be precisely all the roots of z n — 1. For 
the polynomial z n — 1, being of degree n over a field, can have at most n 
distinct roots in that field. 


The more general equation z n = a, where o€,C,a/0, can be reduced 
to the previous case: 

Let a be argument of z, so that a = |o|e*“. Then if w = \a\ l /' n e^ , we 
have 


w 


n 


(\a\^y 

dal 1 /-)- ( e S)" 


So w is a particular solution. Substituting for a in the original equation, 
we get z n = w n , or ( z/w) n = 1. Hence the complete solution is z/w = 
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Ih'Ki 

e n , k = 0, 1, . . 


n — 1, or 


Zk 


a 


1/n 


jOL 

e n e 


IhTxi 

n 


\ci\ 1/n e 


i(cx-\-2kir) 


(5.4) 


k = 0, 1, . . . , n — 1. So the roots are equally spaced on the circle 


and are generated from the special solution having argument equal to (arg a) / n, 
by incrementing the argument in steps of 2ir /n. (See Figure 5.8.) 

EXAMPLE 5.7.1 Factorize the polynomial z 5 — 1 as a product of real 
linear and quadratic factors. 

# 27ri — 27T i 4-7ri — 4-7T i t 

Solution. The roots are 1, e 5 , e 5 ,es ? e 5 ? using the fact that non- 
real roots come in conjugate-complex pairs. Hence 


z 5 -l = (z 


2-Ki —2iri 4-jri 

1 )(z - e s )(z — e s )( Z - e s 


){* 


—47 ri 

e 5 ). 


. 2-kx 

[z - e 5 


)(z-e 


— 27ri 
5 


) 


9 / 27T i —2-Ki 

z — z(e 5; + e s ) + 1 
z 2 - 2 2 : cos \ + L 


Now 
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Similarly 

(z - e s )( z - e s ) = z 2 - 2zcos nf + 1. 
This gives the desired factorization. 

EXAMPLE 5.7.2 Solve z 3 = i. 


Solution. |i| = 1 and Argi = | = a. So by equation 5.4, the solutions are 


Zk 


i(ai+2kir) 

i \ i/6 e 3 


k = 0, 1, 2. 


First, k = 0 gives 


47T 7T 7 T 

zo = e e = cos — hi sin — 
6 6 


\/3 i 
~Y + 2 ' 


Next, /c = 1 gives 


57ri 57T 57T 

zi = e 6 = cos hi sin — 

6 6 



Finally, k = 2 gives 


9™ 97T 97T 

q = e « = cos h zsm — = —i. 

6 6 


We finish this chapter with two more examples of De Moivre’s theorem. 


EXAMPLE 5.7.3 If 


C = 1 + cos 6 + • • • + cos (n — 1)0, 

S = sin 6* + • • • + sin (n — 1)0, 


prove that 


C 


sin f 


sin 


cos 


(n— 1)0 
2 


and S 


sin f 


sin 


sin 


(n— 1)0 
2 ’ 


if 6 yV 2A?7t, k G Z. 
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Solution. 


C + iS = 1 + (cos 9 + i sin 9) + • • • + (cos (n — 1 )9 + i sin (n — 1)0) 


1 + e id + 


J(n- 1)6 


1 + Z + 

1 ~ Z 


z n 1 , where z = e 


id 


1 - z 
1 - e ind 


, if z 7^ 1, i.e. 9 / 2A;7r, 


in# . —inO inO N 

(e 2 — 


1 — e 




/ —iQ ifl > 

e ^ (e^“ - e2 ) 


= e 


i(n-l)| Sm 2 

1 n~ 

sm % 


(cos (n — 1)| + isin {n — 1)|) — 


0 sin n % 


sin 


The result follows by equating real and imaginary parts. 


EXAMPLE 5.7.4 Express cos n9 and sin n9 in terms of cos 9 and sin0, 
using the equation cos n9 + sin n9 = (cos 9 + i sin 9) n . 

Solution. The binomial theorem gives 

(cos 9 + i sin 9) n = cos" 9 + (") cos" -1 9(i sin 9) + (2) cos" -2 9{i sin 0) 2 + • • • 
+ (isin0)". 

Equating real and imaginary parts gives 

cos n9 = cos" 9 — (2) cos" -2 9 sin 2 9 + • • • 

sin n9 = (") cos" -1 9 sin 9 — (!() cos" -3 9 sin 3 9 + ■ ■ ■ . 


5.8 PROBLEMS 

1. Express the following complex numbers in the form x + iy, x, y real: 

(i) (-3 + «)(14-2i); (ii) (iii) 

[Answers: (i) — 40 + 20i; (ii) — (iii) — \ + |-] 

2. Solve the following equations: 
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(i) iz + (2 — 10z)z = 3z + 2i, 

(ii) (1 + i)z + (2 — i)w = —3 i 

(1 + 2i)z + (3 + i)w = 2 + 2 i. 

[Answers: (i) z = (+) ^ = -1 + 5i, w = ^ 

3. Express 1 + (1 + i) + (1 + i) 2 + . . . + (1 + i) 99 in the form x + iy , x, y 
real. [Answer: (1 + 2 50 )i.] 

4. Solve the equations: (i) z 2 = — 8 — 6 i; (ii) z 2 — (3 + i)z + 4 + 3i = 0. 
[Answers: (i) z = ±(1 — 3i); (ii) z = 2 — i, 1 + 2 i] 

5. Find the modulus and principal argument of each of the following 
complex numbers: 

(i)4 + i; (ii) — | (iii)-l + 2i; (iv) \{-l + i\/3). 

[Answers: (i) \/l7, tan -1 (ii) — vr + tan -1 (iii) \/5, i r — 

tan -1 2.] 

6. Express the following complex numbers in modulus-argument form: 

(i) z=(l + i)(l + iv / 3)(v / 3-0- 

oil , _ t i+i >y-^> 5 . 

' (VS + i ) 1 


[Answers: 

(i) z = 4\/2(cos + isin ||); (ii) z = 2'/ 2 (cos ^ + i sin ^).] 

7. (i) If 2 = 2(cos | + zsin |) and w = 3(cos | + isin |), find the polar 

form of 

(a) zw ; (b) *;(c)f;(d) 

(ii) Express the following complex numbers in the form x + iy: 

(a) (l + i) 12 ;(b) (+)~“. 

[Answers: (i): (a) 6(cos || + isin ||); (b) |(cos j^+isin j^); 

(c) §(cos +isin -^); (d) f (cos + isin ^f); 

(ii) : (a) -64; (b) -*.] 
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8. Solve the equations: 

(i) z 2 = 1 + iVS', (ii) z 4 = i; (iii) z 3 = —81; (iv) z 4 = 2 — 21. 

[Answers: (i) z = ± ; (ii) i k ( cos | +lsin | ),k = 0, 1,2,3; (iii) 
z = 21, —a/3 — 1, a/3 — 1; (iv) 2 = ! fc 2§(cos yg — Isin yg), k = 0, 1,2,3.] 

9. Find the reduced row-echelon form of the complex matrix 

'2 + 1 -1 + 21 2 
1 + 1 —1 + 1 1 
1 + 21 — 2 + 1 1 + 1 

'110' 

[Answer: 0 0 1 .] 

_ 0 0 0 J 

10. (i) Prove that the line equation lx + my = n is equivalent to 

pz + pz = 2 n, 

where p = l + im. 

(ii) Use (ii) to deduce that reflection in the straight line 

pz + pz = n 

is described by the equation 

pw + pz = n. 

[Hint: The complex number l + im is perpendicular to the given 
line.] 

(iii) Prove that the line \z — a| = \z — b\ may be written as pz+pz = n, 
where p = b — a and n = |6| 2 — |a| 2 . Deduce that if z lies on the 
Apollonius circle jvzfj = \ then w, the reflection of z in the line 

\z — a\ = \z — b\, lies on the Apollonius circle = j. 

11. Let a and b be distinct complex numbers and 0 < a < ir. 

(i) Prove that each of the following sets in the complex plane rep- 
resents a circular arc and sketch the circular arcs on the same 
diagram: 
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Arg 


z — a 
z-b 


= a , 


—a, 7 t — a, a — it. 


Also show that Arg 

z — b 

a and b, while Arg - 

z — b 

the line through a and b. 


- n represents the line segment joining 
0 represents the remaining portion of 


(ii) Use (i) to prove that four distinct points z±, Z2, 23, 24 are con- 
cyclic or collinear, if and only if the cross-ratio 


24 ~ 21 
24 - 22 


, 23 ~ 2j 
7 23 - 2 2 


is real. 

(iii) Use (ii) to derive Ptolemy’s Theorem: Four distinct points A, B, C, D 
are concyclic or collinear, if and only if one of the following holds: 


AB ■ CD + BC ■ AD = AC ■ BD 
BD ■ AC + AD ■ BC = AB ■ CD 
BD ■ AC T AB ■ CD = AD- BC. 



Chapter 6 


EIGENVALUES AND 
EIGENVECTORS 


6.1 Motivation 

We motivate the chapter on eigenvalues by discussing the equation 

ax 2 + 2 hxy + by 2 = c, 

where not all of a, h, b are zero. The expression ax 2 + 2 hxy + by 2 is called 
a quadratic form in x and y and we have the identity 


ax 


r 1 

a 

h ' 


X 

[ x y \ 

h 

b 


. y . 


= X t AX, 


where X 
form. 


x 

y 


and A 


a h 
h b 


A is called the matrix of the quadratic 


We now rotate the x, y axes anticlockwise through 9 radians to new 
xi, y\ axes. The equations describing the rotation of axes are derived as 
follows: 

Let P have coordinates (x, y) relative to the x, y axes and coordinates 
(aq, yi) relative to the aq, y\ axes. Then referring to Figure 6.1: 
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Figure 6.1: Rotating the axes. 


x = OQ = OP cos {9 + a) 

= OP {cos 9 cos a — sin 9 sin a) 

= {OP cos a) cos 9 — {OP sin a) sin 9 
= OR cos 9 — PR sin 9 
= x'i cos 9 — y\ sin 9. 


Similarly y = x\ sin 9 + y\ cos 9. 

We can combine these transformation equations into the single matrix 
equation: 


X 


cos 9 

— sin 9 


X\ 

y . 


sin 9 

cos 9 


. 2/i . 


or X = PY , where X = 

X 

, Y = 

X\ 

and P = 

cos 9 — sin 9 


y 


yi 


sin 9 cos 9 

We note that the columns of P give the directions of the positive x\ and y\ 


axes. Also P is an orthogonal matrix - we have PP* = I 2 and so P 1 = P t . 
The matrix P has the special property that det P = 1. 


is called a rotation matrix. 


cos 9 — sin 9 
sin 9 cos 9 

We shall show soon that any 2x2 real orthogonal matrix with determinant 


A matrix of the type P = 
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equal to 1 is a rotation matrix. 

We can also solve for the new coordinates in terms of the old ones: 
xi 1 I" cos 6 sin# 1 I" r 

y i J [ — sine/ cos# j \_ U 

so x\ = x cos 9 + y sin 6 and y\ = — x sin 9 + y cos 9. Then 

X*AX = (PYYA(PY) = Y\P t AP)Y. 

Now suppose, as we later show, that it is possible to choose an angle 9 so 
that P f AP is a diagonal matrix, say diag(Ai, A 2 ). Then 

X t AX =lxi yi] \ ^ ,° 1 [ Xl 1 = \ixj + \ 2 yl (6.1) 

L u A2 J L 2/1 

and relative to the new axes, the equation ax 2 + 2 hxy + by 2 = c becomes 
Aixf + A 2 y\ = c, which is quite easy to sketch. This curve is symmetrical 
about the x\ and y\ axes, with Pi and P 2 , the respective columns of P, 
giving the directions of the axes of symmetry. 

Also it can be verified that Pi and P -2 satisfy the equations 

AP\ = A 1 P 1 and AP 2 = \ 2 P 2 - 


These equations force a restriction on Ai and A 2 . For if Pi 
first equation becomes 


a h u\ 

h b v\ 


Ai 


u 1 

Vl 


or 


a — Ai h 
h b- Ai 


u 1 
v\ 


u 1 
Vl 


the 



Hence we are dealing with a homogeneous system of two linear equations in 
two unknowns, having a non-trivial solution {u\, iq). Hence 

a — Ai h 
h b-Xi 

Similarly, A 2 satisfies the same equation. In expanded form, Ai and A 2 
satisfy 

A 2 — (a + b) A + ab — h 2 = 0. 

This equation has real roots 




(6.2) 


(The roots are distinct if a / b or h / 0. The case a = b and h = 0 needs 
no investigation, as it gives an equation of a circle.) 

The equation A 2 — (a + b)X + ab — h 2 = 0 is called the eigenvalue equation 
of the matrix A. 
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6.2 Definitions and examples 

DEFINITION 6.2.1 (Eigenvalue, eigenvector) 

Let A be a complex square matrix. Then if A is a complex number and 
X a non-zero complex column vector satisfying AX = XX, we call X an 
eigenvector of A, while A is called an eigenvalue of A. We also say that X 
is an eigenvector corresponding to the eigenvalue A. 


So in the above example Pi and P 2 are eigenvectors corresponding to Ai 
and A 2 , respectively. We shall give an algorithm which starts from the 


eigenvalues of A = 


a h 
h b 


and constructs a rotation matrix P such that 


P t AP is diagonal. 

As noted above, if A is an eigenvalue of an n x n matrix A, with 
corresponding eigenvector X, then (A — A I n )X = 0, with X / 0, so 
det (A — XI n ) = 0 and there are at most n distinct eigenvalues of A. 

Conversely if det (A — A I n ) = 0, then (A — A I n )X = 0 has a non-trivial 
solution X and so A is an eigenvalue of A with X a corresponding eigenvector. 


DEFINITION 6.2.2 (Characteristic equation, polynomial) 

The equation det (A — A I n ) = 0 is called the characteristic equation of A, 
while the polynomial det (A — A I n ) is called the characteristic polynomial of 
A. The characteristic polynomial of A is often denoted by ch 21 (A). 

Hence the eigenvalues of A are the roots of the characteristic polynomial 
of A. 


For a 2 x 2 matrix A = 


a b 
c d 


it is easily verified that the character- 


istic polynomial is A 2 — (trace A) A + det A, where trace A = a + d is the sum 
of the diagonal elements of A. 


EXAMPLE 6.2.1 Find the eigenvalues of A 
vectors. 


2 1 
1 2 


and find all eigen- 


Solution. The characteristic equation of A is A 2 — 4A + 3 = 0, or 


(A - 1)(A — 3) = 0. 


Hence A = 1 or 3. The eigenvector equation (A — A I n )X = 0 reduces to 


2 - A 
1 



X 


' 0 ' 


. y . 


0 
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or 


(2 - X)x + y = 0 
x + (2 - A )y = 0. 

Taking A = 1 gives 

x + y = 0 
x + y = 0, 


which has solution x = —y, y arbitrary. Consequently the eigenvectors 


corresponding to A = 1 are the vectors 
Taking A = 3 gives 


-y 

y 


, with y / 0. 


— x + y = 0 

x-y = 0, 


which has solution x = y, y arbitrary. Consequently the eigenvectors corre- 


sponding to A = 3 are the vectors 


y 

y 


, with y / 0. 


Our next result has wide applicability: 


THEOREM 6.2.1 Let A be a 2 x 2 matrix having distinct eigenvalues Ai 
and A 2 and corresponding eigenvectors X\ and X- 2 - Let P be the matrix 
whose columns are X\ and X 2 , respectively. Then P is non-singular and 


P _1 AP = 


Ai 0 
0 A 2 


Proof. Suppose AX^ = X\X\ and AX 2 = A 2 ^ 2 - We show that the system 
of homogeneous equations 


xX\ + yX 2 = 0 

has only the trivial solution. Then by theorem 2.5.10 the matrix P = 
[X 1 IX 2 ] is non-singular. So assume 

xX\ + yX 2 = 0. (6.3) 

Then A(xX 1 + yX 2 ) = A0 = 0, so x(AX\) + y(AX 2 ) = 0. Hence 


x\\Xi + t/A2^2 — 0. 


(6.4) 
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Multiplying equation 6.3 by Ai and subtracting from equation 6.4 gives 

(A 2 — X 1 )yX 2 = 0. 

Hence y = 0, as (A 2 — Ai) / 0 and X 2 / 0. Then from equation 6.3, xX\ = 0 
and hence x = 0. 

Then the equations AX \ = A 1 X 1 and AX 2 = A 2 X 2 give 


AP = A[Xi\X 2 ) = [AX 1 \AX 2 \ = [XiXi\X 2 X 2 ] 

= [Xi|X 2 ] 


Ai 

0 

= p 

Ai 

0 

0 

A 2 


0 

A 2 


so 


P _1 AP = 


Ai 0 
0 A 2 


EXAMPLE 6.2.2 Let A = 


2 1 
1 2 


X\ = 


-1 

1 


and X 2 = 


1 and 3, respectively. Hence if P = 

P~ 1 AP = 


be the matrix of example 6.2.1. Then 
are eigenvectors corresponding to eigenvalues 
we have 


-1 1 

1 1 


1 0 
0 3 


There are two immediate applications of theorem 6.2.1. The first is to the 
calculation of A n : If P~ 1 AP =diag(Ai, A 2 ), then A = Pdiag(Ai, A 2 )P _1 
and 


A n 



Ai 0 
0 A 2 



Ai 

0 


0 

X2 


n 

p- 1 = p 


xi 0 

0 XI 


p- 1 . 


The second application is to solving a system of linear differential equations 


where A = 


a b 
c d 


dx 

- = a x + by 

d y , a 

— = cx + dy, 


is a matrix of real or complex numbers and x and y 


are functions of t. The system can be written in matrix form as X = AX, 
where 


X = 


x 

y 


and X = 


x 

y 


r dx 
dt 
dy 

L dt J 
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We make the substitution X = PY, where Y 
are also functions of t and 


Xl 

Vi 


Then x\ and y\ 


X = PY = AX = A(PY), so Y = ( P~ 1 AP)Y 


Ai 0 

0 A 2 


Hence x\ = Aiaq and y\ = A 22/1 - 

These differential equations are well-known to have the solutions x\ = 
aq(0)e Al * and X 2 = .T 2 ( 0 )e A2 *, where aq(0) is the value of x\ when t = 0. 

[If jjj = kx, where k is a constant, then 

4- ( e~ kt x ) = -ke~ kt x + e~ ktd 4 = ~ke~ kt x + e~ kt kx = 0. 
at \ ) at 


Hence e kt x is constant, so e kt x = e fco x(0) = x(0). Hence x = x(0)e kt .] 


However 


21 ( 0 ) 

— P~ 1 

' z(0) ' 

_ 2 / 1 ( 0 ) _ 

1 

. 2/(0) 


so this determines aq(0) and y\ (0) in 


terms of x(0) and y{ 0). Hence ultimately x and y are determined as explicit 
functions of t, using the equation X = PY. 


EXAMPLE 6.2.3 Let A 


2 -3 
4 -5 


derive an explicit formula for A n and a 
equations 


. Use the eigenvalue method to 
.so solve the system of differential 


dx 

dt 

dy 

dt 


2x — 3 y 
Ax — 5 y, 


given x = 7 and y = 13 when t = 0. 

Solution. The characteristic polynomial of A is A 2 +3A+2 which has distinct 
roots Ai = — 1 and A 2 = —2. We find corresponding eigenvectors X\ = ^ 


and X 2 
Hence 


3 

4 


Hence if P 


1 3 
1 4 


we have P l AP = diag(— 1, —2). 


A n = (Pdiag(— 1, — 2)P“ 1 ) n = Pdiag((-l) n , (~2) n )P~ 1 


1 3 ' 

' (-1)" 

1 

O 

4 -3 ' 

1 4 

0 

(-2)" J 

-1 1 
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(-1)" 

' 1 3 ' 

' 1 

0 

4 

1 4 

0 

2 n 

-1 

(-l) n 

' 1 3x2"' 

4 

-3 ' 

1 4x2" 

-1 

1 

(-l) n 

'4-3x2" 

4-4x2" 

-3 + 3x2" 
-3 + 4x2" 


-3 

1 


To solve the differential equation system, make the substitution X = 
PY. Then x = x\ + 3yi, y = x i + 4yi . The system then becomes 


X\ = —X\ 

2/1 = —2z/i, 

so x’i = xi(0)e - *, y± = yi(0)e~’ 2t . Now 


a+(0) 

= P- 1 

' s(0) ' 


4 -3 ' 

7 


' -11 ' 

_ 2/1 (0) _ 

. 2/(0) _ 


-1 1 

13 


6 


so x\ = — lie - * and y\ = 6e -2 *. Hence x = — lie - * + 3(6e -2 *) = —lie - * + 
18e -2 *, y = —lie - * + 4(6e -2 *) = —lie - * + 24e -2 *. 

For a more complicated example we solve a system of inhomogeneous 
recurrence relations. 

EXAMPLE 6.2.4 Solve the system of recurrence relations 

%n + 1 — — X n 2/n 1 

'!Jn+ 1 — %n T 2 y n -f- 2, 

given that xo = 0 and yo = — 1. 

Solution. The system can be written in matrix form as 

Xn+1 = AX n + B , 

where 

It is then an easy induction to prove that 


and B = 


-1 

2 


= A n X 0 + (A n-1 H h A + I 2 )B. 


(6.5) 
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which simplifies to 

' x n 1 = T (2n + 1 — 3 n )/4 ' 

. y n J |_ ( 2 n — 5 + 3 n )/4 

Hence x n = (2 n — 1 + 3”)/4 and y n = (2 n — 5 + 3 n )/4. 

REMARK 6.2.1 If ( A — I- 2 )~ l existed (that is, if det (A — / 2 ) 7 ^ 0, or 

equivalently, if 1 is not an eigenvalue of A), then we could have used the 

formula 

A n_1 + • • • + A + I 2 = (A n - I 2 )(A - /a)" 1 . (6.6) 

However the eigenvalues of A are 1 and 3 in the above problem, so formula 6.6 
cannot be used there. 


Our discussion of eigenvalues and eigenvectors has been limited to 2 x 2 
matrices. The discussion is a more complicated for matrices of size greater 
than two and is best left to a second course in linear algebra. Nevertheless 
the following result is a useful generalization of theorem 6.2.1. The reader 
is referred to [28, page 350] for a proof. 

THEOREM 6.2.2 Let A be an n x n matrix having distinct eigenvalues 
Ai, . . . , A n and corresponding eigenvectors X\, . . . , X n . Let P be the matrix 
whose columns are respectively X{ , . . . , X n . Then P is non-singular and 

[ Ai 0 ••• O' 

0 A 2 • • • 0 

0 0 ■ ■ ■ \ n 


P~ X AP 
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Another useful result which covers the case where there are multiple eigen- 
values is the following (The reader is referred to [28, pages 351-352] for a 
proof): 

THEOREM 6.2.3 Suppose the characteristic polynomial of A has the fac- 
torization 

det (A I n - A) = (A - cif 1 • • • (A - c t ) n \ 

where ci,...,q are the distinct eigenvalues of A. Suppose that for i = 

1 , , . . , t, we have nullity (cj/ n — A) = n,. For each i , choose a basis Xu , . . . , AQ ni 
for the eigenspace N(cil n — A). Then the matrix 


P=[X u\---\X lni \---\X tl \---\X tnt } 


is non-singular and P 1 AP is the following diagonal matrix 


P _1 AP 


^ 1 1 n i 0 

0 C2 1 ii2 


0 

0 


0 0 • • • c t I nt 


(The notation means that on the diagonal there are ni elements ci, followed 
by n 2 elements C 2 ,. . . , n* elements Cf.) 


6.3 PROBLEMS 


1. Let A = 


4 -3 
1 0 

diag(l, 3) and hence prove that 


Find a non-singular matrix P such that P l AP = 


An 3 n — 1 3 — 3 n 

A = "4 + — — h- 


2. If A 


0.6 0.8 

0.4 0.2 


prove that A n tends to a limiting matrix 


2/3 2/3 
1/3 1/3 


as n 


oo. 
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3. Solve the system of differential equations 


d i - 

f = 5s-4 Vl 

given x = 13 and y = 22 when t = 0. 

[Answer: x = 7e* + 6e~ 2t , y = 7e t + 15e~ 2 *.] 

4. Solve the system of recurrence relations 

%n + 1 — 3x n y n 

Vn + 1 — A 3 y n , 

given that x'o = 1 and yo = 2. 

[Answer: x n = 2 n_1 (3 — 2 n ), y n = 2 n_1 (3 + 2 n ).] 


5. Let A = 


a b 
c d 


be a real or complex matrix with distinct eigenvalues 
Ai, A 2 and corresponding eigenvectors X 2 . Also let P = [X\ \X 2 ], 
(a) Prove that the system of recurrence relations 

2-n+i — X by n 

y n +i = cx n + dy n 


has the solution 


x n 

Vn 


= aX'iXi + (3X%X 2 , 


where a and (3 are determined by the equation 


a 

P 


= P 


-1 


x 0 

yo 


(b) Prove that the system of differential equations 

dx , 

— = ax + by 
dt 

d y , a 

— = cx + dy 


has the solution 


x 

y 


= ae Xlt Xi +f3e X2t X 2 , 
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where a and j3 are determined by the equation 


a 

0 


= P~ l 


x(0) 

2 /( 0 ) 


6. Let A = 


be a real matrix with non-real eigenvalues A = 


Oil «12 
«21 022 

a + ib and A = a — ib, with corresponding eigenvectors X = U + iV 
and X = U — iV, where U and V are real vectors. Also let P be the 
real matrix defined by P = [/7|V]. Finally let a + ib = re , where 
r > 0 and 6 is real. 


(a) Prove that 


AU = aU-bV 
AV = bU + aV. 


(b) Deduce that 


P~ 1 AP 


a —b 
b a 


(c) Prove that the system of recurrence relations 


%n + 1 — ailX n 012 Vn 

Un + 1 = 02 lX n + a22Un 


has the solution 


Xn 

Vn 


= r n {(aU + f3V) cos n6 + (f3U 


aV) sin nd}, 


where a and (3 are determined by the equation 


a 

0 


= P 


-l 


Xq 

yo 


(d) Prove that the system of differential equations 

dx , 

— = ax + by 

dt 


dy 

dt 


cx + dy 
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has the solution 


= e at {(aU + PV) cos bt + (/3U — aV) sin bt}, 


where a and /3 are determined by the equation 


[Hint: Let 


and deduce that 


Also let z = x\ + iy\. Prove that 


z = (a — ib)z 


x\ + iyi = e at (a + i/3) (cos bt + isin bt). 

Then equate real and imaginary parts to solve for x\, yi and 
hence x, y] 

7. (The case of repeated eigenvalues.) Let A = a ^ and suppose 

that the characteristic polynomial of A, A 2 — (a + d)X + (ad — be), has 
a repeated root a. Also assume that A / a/ 2 . Let B = A — a/ 2 . 

(i) Prove that (a — d) 2 + 4bc = 0. 

(ii) Prove that B 2 = 0. 

(iii) Prove that BX 2 / 0 for some vector X% indeed, show that X 2 
can be taken to be J or ^ 

(iv) Let X\ = BX 2 . Prove that P = [A 1 IA 2 ] is non-singular, 


AX\ — olX\ and AX 2 — 0 .X 2 T X\ 


and deduce that 


P~ l AP = 


8. Use the previous result to solve system of the differential equations 

dx 

Tt = ix ~ v 


4x + 8 y 
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given that x = 1 = y when t = 0. 

[To solve the differential equation 

dx , 

— kx = ft, k a constant, 

dt 

multiply throughout by e~ kt , thereby converting the left-hand side to 
[Answer: x = (1 — 3f)e w , y = (1 + 6 f)e 6t .] 


9. Let 


A = 


1/2 1/2 0 
1/4 1/4 1/2 
1/4 1/4 1/2 


(a) Verify that det (A/3 — A), the characteristic polynomial of A, is 
given by 

(A — 1)A(A — |). 


(b) Find a non-singular matrix P such that P l AP = diag(l, 0, |). 

(c) Prove that 


A n = 


1 

3 


1 1 1 
1 1 1 
1 1 1 


1 

+ 3 • 4 n 


2 2-4 

-1 -1 2 
-1 -1 2 


if n > 1. 


10. Let 



-2 

-2 

5 


(a) Verify that det (A/3 — A), the characteristic polynomial of A, is 
given by 

(A — 3) 2 (A — 9). 

(b) Find a non-singular matrix P such that P~ 1 AP = diag (3, 3, 9). 



Chapter 7 

Identifying second degree 
equations 


7.1 The eigenvalue method 

In this section we apply eigenvalue methods to determine the geometrical 
nature of the second degree equation 

ax 2 + 2 hxy + by 2 + 2 gx + 2 fy + c = 0, (7.1) 


where not all of a, h, b are zero. 
a h 
h b 


Let A = 


be the matrix of the quadratic form ax 2 + 2 hxy + by 2 . 


We saw in section 6.1, equation 6.2 that A has real eigenvalues Ai and A 2 , 
given by 


a + b — y/(a — b) 2 + Ah 2 a + b + y/(a — b) 2 + 4 h 2 

Ai = g ’ A2 = 2 • 

We show that it is always possible to rotate the x, y axes to x±, X 2 axes whose 
positive directions are determined by eigenvectors X\ and X 2 corresponding 
to Aiand A 2 in such a way that relative to the x\, y\ axes, equation 7.1 takes 
the form 

a'x 2 + b'y 2 + 2g'x + 2f'y + c = 0. (7.2) 

Then by completing the square and suitably translating the aq, y\ axes, 
to new X 2 , 2/2 axes, equation 7.2 can be reduced to one of several standard 
forms, each of which is easy to sketch. We need some preliminary definitions. 
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DEFINITION 7.1.1 (Orthogonal matrix) An n x n real matrix P is 
called orthogonal if 

P l P = In- 


It follows that if P is orthogonal, then det P = ±1. For 

det {P t P) = det P l det P = ( det P) 2 , 

so (det l 3 ) 2 =det I n = 1. Hence det P = ±1. 

If P is an orthogonal matrix with det P = 1 , then P is called a proper 
orthogonal matrix. 


THEOREM 7.1.1 If P is a 2 x 2 orthogonal matrix with detP = 1, then 


P = 


cos 9 — sin 9 
sin 9 cos 9 


for some 9. 


REMARK 7.1.1 Hence, by the discusssion at the beginning of Chapter 
6, if P is a proper orthogonal matrix, the coordinate transformation 


x 

V 



represents a rotation of the axes, with new x\ and y\ axes given by the 
repective columns of P. 


Proof. Suppose that P t P = I 2 , where A =detP = 1. Let 


P = 


a b 
c d 


Then the equation 

P f = P- 1 = adj P 
A 


gives 


a 

c 


d 

-b ' 

b 

d 


—c 

a 


Hence a = d, b = —c and so 


P = 


a 

c 


—c 

a 


where a 2 + c 2 = 1. But then the point (a, c) lies on the unit circle, so 
a = cos 9 and c = sin0, where 9 is uniquely determined up to multiples of 


2t r. 
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DEFINITION 7.1.2 (Dot product). If X = 

X ■ Y, the dot product of X and Y. is defined by 

X -Y = ac + bd. 

The dot product has the following properties: 

(i) X ■ (Y + Z) = X ■ Y + X ■ Z- 

(ii) X ■ Y = Y ■ X; 

(hi) (tX) ■ Y = t(X ■ y); 


and Y = 


(iv) X ■ X = a 2 + b 2 if X = 


(v) X-Y = X f Y. 

The length of X is defined by 

\\X\\ = Va 2 + b 2 = (. X-X ) 1/2 . 

We see that ||X|| is the distance between the origin O = (0, 0) and the point 
(a, b). 

THEOREM 7.1.2 (Geometrical interpretation of the dot product) 

Let A = (x\, yi) and B = (x' 2 , 1 ) 2 ) be points, each distinct from the origin 


o = (0, 0). Then if X = 


and Y = 


X ■ Y = OA ■ OB cos 9, 


we have 


where 9 is the angle between the rays OA and OB. 

Proof. By the cosine law applied to triangle OAB , we have 

AB 2 = OA 2 + OB 2 - 20A ■ OB cos 9. 

Now AB 2 = (x 2 - xi) 2 + (y 2 - y±) 2 , OA 2 = x 2 + y \ , OB 2 = x 2 + y 2 . 
Substituting in equation 7.3 then gives 


(x 2 - X1) 2 + ( 2/2 - ViY = {x\ + y\) + (X2 + yl) - 204 • OBcos9 , 
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which simplifies to give 

OA ■ OB cos 9 = x\X 2 + 2/12/2 = X ■ Y. 

It follows from theorem 7.1.2 that if A = (xi, 2/1) and B = (a; 2 , 1 ) 2 ) are 

points distinct from O = (0, 0) and X = 

X Y = 0 means that the rays OA and OB are perpendicular. This is the 
reason for the following definition: 

DEFINITION 7.1.3 (Orthogonal vectors) Vectors X and Y are called 
orthogonal if 

XY = 0 . 


and Y = 


X2 

m 


then 


There is also a connection with orthogonal matrices: 


THEOREM 7.1.3 Let P be a 2 x 2 real matrix. Then P is an orthogonal 
matrix if and only if the columns of P are orthogonal and have unit length. 


Proof. P is orthogonal if and only if P*P = / 2 . Now if P = [Xi|X 2 ], the 
matrix P f P is an important matrix called the Gram matrix of the column 
vectors X\ and X 2 . It is easy to prove that 


P t P=[X l -X j ] 


X 1 -X 1 X 1 -X 2 

v 2 • V, x 2 -x 2 


Hence the equation P t P = / 2 is equivalent to 


' x 1 ■ X 1 X\ ■ x 2 


' 10 ' 

_ X 2 ■ Xy X 2 ■ x 2 


0 1 


or, equating corresponding elements of both sides: 


X 1 -X 1 = 1, X 1 • A 2 = 0, A 2 • A 2 = 1, 

which says that the columns of P are orthogonal and of unit length. 

The next theorem describes a fundamental property of real symmetric 
matrices and the proof generalizes to symmetric matrices of any size. 

THEOREM 7.1.4 If X\ and X 2 are eigenvectors corresponding to distinct 
eigenvalues Ai and A 2 of a real symmetric matrix A, then X± and X 2 are 
orthogonal vectors. 
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Proof. Suppose 

AX i = AiXi, AX 2 = X 2 X 2 , (7.4) 

where X\ and X 2 are non-zero column vectors, A 1 = A and Ai ^ \ 2 . 

We have to prove that X\X 2 = 0. From equation 7.4, 


X\ AX 1 = X\X\X\ 

(7.5) 

x[ax 2 = X 2 X[X 2 . 

(7.6) 


From equation 7.5, taking transposes, 

(XZAXtf = {XiXlxrf 

so 

X\A t X 2 = Ai X\X 2 . 

Hence 

X\AX 2 = A!X(X 2 . (7.7) 

Finally, subtracting equation 7.6 from equation 7.7, we have 

(Ai - X 2 )XiX 2 = 0 


and hence, since Ai / A 2 , 

X\X 2 = 0. 

THEOREM 7.1.5 Let A be a real 2x2 symmetric matrix with distinct 
eigenvalues Ai and A 2 . Then a proper orthogonal 2x2 matrix P exists such 
that 

P t AP= diag (Ai, A 2 ). 

Also the rotation of axes 


x 

V 



“diagonalizes” the quadratic form corresponding to A: 


X*AX = X lX f + A 2 yl 
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Proof. Let X\ and X2 be eigenvectors corresponding to Ai and A 2 . Then 
by theorem 7.1.4, X\ and X2 are orthogonal. By dividing X\ and X2 by 
their lengths (i.e. normalizing X\ and X2) if necessary, we can assume that 
X\ and X2 have unit length. Then by theorem 7.1.1, P = [X 1 IX 2 ] is an 
orthogonal matrix. By replacing X\ by —X\ , if necessary, we can assume 
that detP = 1. Then by theorem 6.2.1, we have 


P t AP = P~ l AP 


Ai 0 
0 A 2 


Also under the rotation X = PY, 

X f AX = (. PY) t A(FY ) = Y t (P t AP)Y = y*diag(Ai, A 2 )Y 
= Xix\ + \ 2 y\. 


EXAMPLE 7 . 1.1 Let A be the symmetric matrix 

, T 12 -6 1 

[-6 rj- 

Find a proper orthogonal matrix P such that P l AP is diagonal. 
Solution. The characteristic equation of A is A 2 — 19A + 48 = 0, or 

(A - 16)(A — 3) = 0. 


Hence A has distinct eigenvalues Ai = 16 and A 2 = 3. We find corresponding 
eigenvectors 


-3 

2 


Xi = 

Now ||Xi|| = 1 1 JX 2 1 1 = \/l3. So we take 

*i = 4 


and X 2 = 


' -3 ' 

and X-> = — — = 

' 2 ' 

2 

~ v/13 

3 


Then if P = [X 1 IX 2 ], the proof of theorem 7.1.5 shows that 


P f AP 


16 0 
0 3 


However det P 


1, so replacing X\ by —X\ will give detP = 1. 
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Figure 7.1: 12x 2 — 12 xy + 7 y 2 + 60x — 38 y + 31 = 0. 

REMARK 7.1.2 (A shortcut) Once we have determined one eigenvec- 
tor X\ = . the other can be taken to be , as these these vectors 

b J [a 

are always orthogonal. Also P = [Ai| X?\ will have det P = a 2 + b 2 >0. 

We now apply the above ideas to determine the geometric nature of 
second degree equations in x and y. 

EXAMPLE 7.1.2 Sketch the curve determined by the equation 

12x 2 — 12 xy + 7 y 2 + 60x — 38 y + 31 = 0. 

Solution. With P taken to be the proper orthogonal matrix defined in the 
previous example by 

= r 3/ \/l3 2/v/l3 ' 

[ —2/^13 3/v/l3 J ’ 

then as theorem 7.1.1 predicts, P is a rotation matrix and the transformation 


X = 


x 

y 


= py = p 


Xi 

yi 
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or more explicitly 


3xi + 2 2/1 -2xi + 3 //i 

x = — , y = 


\/l3 


\/l3 


(7.8) 


will rotate the x, y axes to positions given by the respective columns of P. 
(More generally, we can always arrange for the xi axis to point either into 
the first or fourth quadrant.) 

is the matrix of the quadratic form 


Now A = 


-6 7 


12x 2 — 12xy + 7 y 2 , 


so we have, by Theorem 7.1.5 

12x 2 — I2xy + 7 y 2 = 16x 2 + 3y 2 . 

Then under the rotation X = PY, our original quadratic equation becomes 


60 


38 


16xj + 3 y{ H — /=(3xi + 2yi) =(-2xi + 3//i) + 31 = 0, 


\/l3 


\/l3 


or 


o 9 256 6 

16x1 + 3yi + ~^^ Xl + + 31 — 0. 


Now complete the square in xi and y \ : 


16 ( x 2 + -^=xi^ + 3 (^yl + -^= 2/1 ) + 31 — 0, 


16 xi + 


\/l3 


+ 3 yi + 


v/13 


= 16 


= 48. 


a/13 


+ 3 


\/l3 


- 31 


(7.9) 


Then if we perform a translation of axes to the new origin (xi, 2 / 1 ) = 

( 8 ]_). 

1 \Zl3’ vTN' 

8 1 
x 2 = xi + -=, 2/2 = yi + 


equation 7.9 reduces to 


v/l3’ yz " ' y/lT 
16x 2 + 32/| = 48, 


A , A 

3 16 


or 
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This equation is now in one of the standard forms listed below as Figure 7.2 
and is that of a whose centre is at (® 2 , IJ 2 ) = (0, 0) and whose axes of 
symmetry lie along the X 2 , y 2 axes. In terms of the original x, y coordinates, 
we find that the centre is (x, y) = (—2, 1). Also Y = so equations 7.8 
can be solved to give 

3aq - 2y, 2aq + 3y\ 

Xl = yi = ~vnr- 

Hence the y 2 _ax is is given by 


3.x — 2 y 8 

““ + ^ 13 ’ 

or 3x — 2y + 8 = 0. Similarly the X 2 axis is given by 2x + 3y + 1 = 0. 

This ellipse is sketched in Figure 7.1. 

Figures 7.2, 7.3, 7.4 and 7.5 are a collection of standard second degree 
equations: Figure 7.2 is an ellipse; Figures 7.3 are hyperbolas (in both these 

examples, the asymptotes are the lines y = ±-x); Figures 7.4 and 7.5 

a 

represent parabolas. 



EXAMPLE 7.1.3 Sketch y 2 — 4x — 10 y — 7 = 0. 
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Solution. Complete the square: 

y 2 - lOy + 25 - 4x - 32 = 0 

(y — 5) 2 = 4x + 32 = 4(® + 8), 

or y\ = 4x'i, under the translation of axes aq = x + 8, y\ = y — 5. Hence we 
get a parabola with vertex at the new origin (aq, y\ ) = (0, 0), i.e. (x, y) = 
(-8, 5). 

The parabola is sketched in Figure 7.6. 

EXAMPLE 7.1.4 Sketch the curve x 2 — Axy + 4 y 2 + 5y — 9 = 0. 
Solution. We have x 2 — 4 xy + 4 y 2 = X t AX, where 



The characteristic equation of A is A 2 — 5A = 0, so A has distinct eigenvalues 
Ai = 5 and A 2 = 0. We find corresponding unit length eigenvectors 






Then P = [W |W)] is a proper orthogonal matrix and under the rotation of 
axes X = PY, or 


x 

y 


x\ + 2 yi 
" 

-2.T1 + yi 

V5 
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Figure 7.6: y 2 — 4x — 10 y — 7 = 0. 


we have 

x 2 - 4 xy + 4 y 2 = Aixf + A 22 /f = 5xf. 
The original quadratic equation becomes 


5xf + 
5(xf - 
5(xi - 


V5 

y/h 


(-2xi +yi) - 9 


-^=xi) + \/%i - 9 
^ )2 = 10 -^ 


0 

0 

v /%1 - 2v/5), 


or 5x 2 = ~^y 2 , where the xi, y\ axes have been translated to X 2 , y 2 axes 
using the transformation 


X2 = xi 7=, 2/2 = 2/1 — 2\/5. 

v 5 


Hence the vertex of the parabola is at (x 2 , y 2 ) = (0, 0), i.e. (xi, y\) = 
2v/5), or (x, y) = |). The axis of symmetry of the parabola is the 

line X 2 = 0, i.e. xi = l/Vh. Using the rotation equations in the form 

x — 2 y 


Xl 
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Figure 7.7: x 2 — 4 xy + 4 y 2 + by — 9 = 0. 


2 x + y 


we have 


x — 2 y 1 


or x — 2y = 1. 


The parabola is sketched in Figure 7.7. 


7.2 A classification algorithm 

There are several possible degenerate cases that can arise from the general 
second degree equation. For example x 2 + y 2 = 0 represents the point (0, 0); 
x 2 + y 2 = —1 defines the empty set, as does x 2 = —1 or y 2 = —1; x 2 = 0 
defines the line x = 0; ( x + y) 2 = 0 defines the line x + y = 0; x 2 — y 2 = 0 
defines the lines x — y = 0, x + y = 0] x 2 = 1 defines the parallel lines 
x = ±1] (x + y) 2 = 1 likewise defines two parallel lines x + y = ±1. 

We state without proof a complete classification 1 of the various cases 


x This classification forms the basis of a computer program which was used to produce 
the diagrams in this chapter. I am grateful to Peter Adams for his programming assistance. 
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that can possibly arise for the general second degree equation 
ax 2 + 2 hxy + by 2 + 2 gx + 2 fy + c = 0. 


(7.10) 


It turns out to be more convenient to first perform a suitable translation of 
axes, before rotating the axes. Let 


A = 


If C + 0, let 


aha 
h b f 
9 f c 


C = ab — h 2 , A = be — f 2 , B = ca — g 2 


a = 


9 h 
f b 


C 


f3 = 


a g 

h f 


C 


(7.11) 


CASE 1. A = 0. 


(1.1) C/ 0. Translate axes to the new origin (a, j3), where a and (3 are 
given by equations 7.11: 


x = xi+a, y = yi+/3. 


Then equation 7.10 reduces to 

ax\ + 2hx\y\ + byf = 0. 

(a) C > 0: Single point (x, y) = (a, /?). 

(b) C < 0: Two non— parallel lines intersecting in (x, y) = (a, /3). 
The lines are 

y- (3 

x — a 
x = a 

(1.2) C = 0. 

(a) h = 0. 

(i) a = g = 0. 

(A) A > 0: Empty set. 

(B) A = 0: Single line y = —f/b. 


—h ± \f — ( 


if 6^0, 


and “A if 6 = 0. 

x — a 2 h 
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(C) A < 0: Two parallel lines 


(ii) b = f = 0. 

(A) B > 0 

(B) B = 0 

(C) B < 0 


— / ± '/-A 


y = 


Empty set. 

Single line x = —g/a. 
Two parallel lines 


x = 


-g± \f^B 


(b) h + 0. 

(i) B > 0 

(ii) B = 0 

(iii) B < 0 


Empty set. 

Single line ax + hy = —g. 

Two parallel lines 


ax 


+ hy = -g± \rH3. 


CASE 2. A/0. 

(2.1) C / 0. Translate axes to the new origin (a, /3), where a 
given by equations 7.11: 

x = xi+a, y = yi+ (3. 


Equation 7.10 becomes 

axj + 2hxiyi + byj = - — . 

CASE 2.1(i) h = 0. Equation 7.12 becomes axf + byf = 

(a) C < 0: Hyperbola. 

(b) C > 0 and aA > 0: Empty set. 

(c) C > 0 and aA < 0. 

(i) a = b: Circle, centre (a, /?), radius \] 

(ii) a / b: Ellipse. 


and (3 are 


(7.12) 

A 
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Translate axes to (x±, y\ ) axes: 


2 

Vl = ~T Xl - 

(ii) 6 = 0: Then a / 0 and / / 0. Parabola with vertex 


. 


_ 9 _ 

a ’ 2/a / 

Translate axes to (x \ , yi ) axes: 


2 2 / 

= — y i- 

a 


(b) h / 0: Parabola. Let 


A , = g« + bf 
a + b 

The vertex of the parabola is 

/ (2 akf — hk 2 — hac ) a(T 2 + ac — 2kg) 

V d ’ d 

Now translate to the vertex as the new origin, then rotate to 
(x 2 , 2/2) axes with the positive X 2 ~axis along [sa, —sh], where 
s = sign (a). 

(The positive X 2 -axis points into the first or fourth quadrant.) 
Then the parabola has equation 

2 —2 st 

" 2 = 7^T7? y2 ’ 

where t = (af — gh) /(a + b). 

REMARK 7.2.1 If A = 0, it is not necessary to rotate the axes. Instead 
it is always possible to translate the axes suitably so that the coefficients of 
the terms of the first degree vanish. 

EXAMPLE 7.2.1 Identify the curve 

2x 2 + xy — y 2 + 6y — 8 = 0. 



(7.14) 
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Solution. Here 

2 ± 0 

A= \ -1 3 =0. 

6 3-8 

Let x = x\ + a, y = yi + (3 and substitute in equation 7.14 to get 

2(xi + a) 2 + (xi + a)(yi + (3) - (yi + (3) 2 + 4(yi + f3) - 8 = 0. (7.15) 

Then equating the coefficients of x\ and y\ to 0 gives 

4a + (3 = 0 
a + 2/3 + 4 = 0, 

which has the unique solution a = — |, (3 = |. Then equation 7.15 simplifies 
to 

2x\ + x\y\ — 2 /i =0 = (2xi ~ yi){xi + yi), 

so relative to the xi, y\ coordinates, equation 7.14 describes two lines: 2xq — 
yi = 0 or x\ + y\ = 0. In terms of the original x, y coordinates, these lines 
become 2(x+ |) — (y — |) = 0 and (x + |) + (y — |) = 0, i.e. 2x — y + 4 = 0 
and x + y — 2 = 0, which intersect in the point 

(x, y) = (a, (3) = (-^, |). 

EXAMPLE 7.2.2 Identify the curve 

x 2 + 2 xy + y 2 + +2x + 2y + 1 = 0. (7.16) 

Solution. Here 

1 1 1 

A = 1 1 1 = 0. 

1 1 1 

Let x = xi + a, y = yi + (3 and substitute in equation 7.16 to get 
(xi + a) 2 +2(xi +o) (yi +/3) + (yi +/3) 2 + 2(xi + o) +2(yi +/3) + 1 = 0. (7.17) 
Then equating the coefficients of xi and yi to 0 gives the same equation 

2a + 2/3 + 2 = 0. 

Take a = 0, (3 = — 1. Then equation 7.17 simplifies to 
x\ + 2xiyi + y 2 = 0 = (xi + yi) 2 , 
and in terms of x, y coordinates, equation 7.16 becomes 
(x + y + l) 2 = 0, or x + y + 1 = 0. 
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7.3 PROBLEMS 

1. Sketch the curves 

(i) x 2 — 8x + 8y + 8 = 0; 

(ii) y 2 — 12x + 2 y + 25 = 0. 

2. Sketch the hyperbola 

4 xy — 3y 2 = 8 
and find the equations of the asymptotes. 

[Answer: y = 0 and y = |x.] 

3. Sketch the ellipse 

8.x 2 — 4 xy + 5 y 2 = 36 

and find the equations of the axes of symmetry. 

[Answer: y = 2x and x = —2 y] 

4. Sketch the conics defined by the following equations. Find the centre 
when the conic is an ellipse or hyperbola, asymptotes if an hyperbola, 
the vertex and axis of symmetry if a parabola: 

(i) 4x 2 — 9 y 2 — 24x — 36 y — 36 = 0; 

(ii) 5x 2 — 4 xy + 8 y 2 + 4\/5x — 16\/5 y + 4 = 0; 

(iii) 4x 2 + y 2 — 4 xy — 10 y — 19 = 0; 

(iv) 77 x 2 + 78 xy — 27 y 2 + 70x — 30y + 29 = 0. 

[Answers: (i) hyperbola, centre (3, —2), asymptotes 2x — 3y — 12 = 
0, 2x + 3y = 0; 

(ii) ellipse, centre (0, v^5); 

(iii) parabola, vertex (— |, — |), axis of symmetry 2x — y + 1 = 0; 

(iv) hyperbola, centre (— ^, ^), asymptotes 7x + 9y + 7 = 0 and 
llx -3y-l = 0.] 

5. Identify the lines determined by the equations: 

(i) 2x 2 + y 2 + 3 xy — 5x — 4y + 3 = 0; 
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(ii) 9x 2 + y 2 — 6 xy + 6x — 2y + 1 = 0; 

(iii) x 2 + 4 xy + Ay 2 — x — 2y — 2 = 0. 

[Answers: (i) 2x + y — 3 = 0 and x + y — 1 = 
(iii) x + 2y + 1 = 0 and x + 2y — 2 = 0.] 


0; (ii) 2>x - y + 1 = 0; 



Chapter 8 


THREE-DIMENSIONAL 

GEOMETRY 


8.1 Introduction 

In this chapter we present a vector-algebra approach to three-dimensional 
geometry. The aim is to present standard properties of lines and planes, 
with minimum use of complicated three-dimensional diagrams such as those 
involving similar triangles. We summarize the chapter: 

Points are defined as ordered triples of real numbers and the distance 
between points P\ = (xi, yi, z\ ) and P-2 = (x2, V2, 22) is defined by the 
formula 

P1P2 = V ( x 2 ~ xi ) 2 + (2/2 - Vi ) 2 + O2 - zi ) 2 . 

Directed line segments AB are introduced as three-dimensional column 
vectors: If A = (xi, y\, z\) and B = (x2, 2/2, Z2), then 

x 2 - Xl 
m - 2/1 • 

■ 2-2 Z\ 

If P is a point, we let P =OP and call P the position vector of P. 

With suitable definitions of lines, parallel lines , there are important ge- 
ometrical interpretations of equality, addition and scalar multiplication of 
vectors. 

(i) Equality of vectors: Suppose A, B, C , D are distinct points such that 
no three are collinear. Then AB=CD if and only if AB || CD and 
AC || BD (See Figure 8 . 1 .) 
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z 



Figure 8.1: Equality and addition of vectors. 


(ii) Addition of vectors obeys the parallelogram law: Let A, B, C be non- 
collinear. Then 

AH + AC=AD, 

where D is the point such that AB || CD and AC || BD. (See Fig- 
ure 8.1.) 


(iii) Scalar multiplication of vectors: Let AP= t AB, where A and B are 
distinct points. Then P is on the line AB, 


AP 

AB 


1*1 


and 

(a) P = A if t = 0, P = B if t = 1; 

(b) P is between A and B if 0 < t < 1; 

(c) B is between A and P if 1 < i; 

(d) A is between P and B if t < 0. 


(See Figure 8.2.) 
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z 



Figure 8.2: Scalar multiplication of vectors. 



ai 


a 2 

The dot product X Y of vectors X = 

bi 

and Y = 

b 2 


. c i 


. C 2 


by 


X Y = oia 2 + bib-2 + cic 2 . 


is defined 


The length ||X|| of a vector X is defined by 


X\\ = (X -X) 1 / 2 


and the Cauchy-Schwarz inequality holds: 

\X-Y\<\\X\\-\\Y 


The triangle inequality for vector length now follows as a simple deduction: 


x + y\\ < \\x\\ + ||y||. 


Using the equation 


AB = II AB 


we deduce the corresponding familiar triangle inequality for distance: 
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The angle 6 between two non-zero vectors X and Y is then defined by 

X • Y 


cos V = 


0 < 0 < IT. 


1*11 -i mr 

This definition makes sense. For by the Cauchy-Schwarz inequality, 

X Y 


-1 < 


\x • y 


< l. 


Vectors X and Y are said to be perpendicular or orthogonal if X ■ Y = 0. 
Vectors of unit length are called unit vectors. The vectors 



' 1 ' 


o 


o 

II 

0 

, j = 

1 

II 

0 


1 

O 

i 


1 

0 

1 


_ 1 _ 


are unit vectors and every vector is a linear combination of i, j and k: 


a 

b 

c 


= ai + 6j + ck. 


Non-zero vectors X and Y are parallel or proportional if the angle be- 
tween X and Y equals 0 or ir; equivalently if X = tY for some real number 
t. Vectors X and Y are then said to have the same or opposite direction, 
according asf>0orf<0. 

We are then led to study straight lines. If A and B are distinct points, 
it is easy to show that AP + PB = AB holds if and only if 


AP= t AB, where 0 < t < 1. 


A line is defined as a set consisting of all points P satisfying 


P = Po + tX, tG R or equivalently PqP= tX, 

for some fixed point Po and fixed non-zero vector X called a direction vector 
for the line. 

Equivalently, in terms of coordinates, 

x = xo + ta, y = yo + tb, z = zo + tc, 
where Pq = (.To, yo, zq) and not all of a, b , c are zero. 
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There is then one and only one line passing passing through two distinct 
points A and B. It consists of the points P satisfying 

AP= t ATI, 


where t is a real number. 

The cross-product XxY provides us with a vector which is perpendicular 
to both X and Y . It is defined in terms of the components of X and Y: 

Let X = aii + 6ij + cik and Y = ci 2 i + & 2 j + C 2 k. Then 

XxY = ai + b] + ck, 


where 


a = 


bi ci 

b2 c 2 


«1 

Cl 


a\ 

bi 

«2 

C2 

, c = 

a>2 

b 2 


The cross-product enables us to derive elegant formulae for the distance 
from a point to a line, the area of a triangle and the distance between two 
skew lines. 


Finally we turn to the geometrical concept of a plane in three-dimensional 
space. 

A plane is a set of points P satisfying an equation of the form 


P = Po + sX + tY, s, i £ I, 


(8.1) 


where X and Y are non-zero, non-parallel vectors. 

In terms of coordinates, equation 8.1 takes the form 


x = xq + sa i + ta 2 
V = Vo + sbi + tb 2 
Z = Zo + SCI + tC2, 


where P 0 = (x 0 , yo, zq). 

There is then one and only one plane passing passing through three 
non-collinear points A, B, C. It consists of the points P satisfying 


~AP= s AB +t AC , 
where s and t are real numbers. 

The cross-product enables us to derive a concise equation for the plane 
through three non-collinear points A, B, C, namely 
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When expanded, this equation has the form 

ax + by + cz = d, 

where ai + 6j + ck is a non-zero vector which is perpendicular to P 1 P 2 for 
all points Pi, P 2 lying in the plane. Any vector with this property is said to 
be a normal to the plane. 

It is then easy to prove that two planes with non-parallel normal vectors 
must intersect in a line. 

We conclude the chapter by deriving a formula for the distance from a 
point to a plane. 

8.2 Three-dimensional space 

DEFINITION 8.2.1 Three-dimensional space is the set E 3 of ordered 
triples ( x , y, z ), where x, y, z are real numbers. The triple (x, y, z) is called 
a point P in E 3 and we write P = (x, y, z). The numbers x , y, z are called, 
respectively, the x, y, z coordinates of P. 

The coordinate axes are the sets of points: 

(0,0, 0)} (x-axis), {(0, y, 0)} (y-axis), {(0, 0, z)} (z-axis). 

The only point common to all three axes is the origin O = (0, 0, 0). 

The coordinate planes are the sets of points: 

{(A, V, 0)} (xy-plane), {(0, y, z)} (yz-plane), {(x, 0, z)} (xz-plane). 

The positive octant consists of the points (x, y, z), where x > 0, y > 
0, z > 0. 

We think of the points (x, y, z) with z > 0 as lying above the xy-plane, 
and those with z < 0 as lying beneath the xy-plane. A point P = (x, y, z) 
will be represented as in Figure 8.3. The point illustrated lies in the positive 
octant. 

DEFINITION 8.2.2 The distance P 1 P 2 between points Pi = (xi, yi, zi) 
and P 2 = (x 2 , y 2 , Z 2 ) is defined by the formula 

P 1 P 2 = V ( x 2 ~ xi) 2 + (y 2 - yi) 2 + (z 2 - zi) 2 . 

For example, if P = (x, y, z), 


OP = \/ x 2 + y 2 + z 2 . 
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DEFINITION 8.2.3 If A = (xi, y i, z\) and B = (. x 2 , y 2 , Z 2 ) we define 
the symbol AB to be the column vector 

x 2 - X\ 

2/2 - 2/1 • 

^2 - Z! _ 

We let P =OP and call P the position vector of P. 

The components of AB are the coordinates of B when the axes are 
translated to A as origin of coordinates. 

We think of AB as being represented by the directed line segment from 
A to B and think of it as an arrow whose tail is at A and whose head is at 
B. (See Figure 8.4.) 

Some mathematicians think of AB as representing the translation of 
space which takes A into B. 

The following simple properties of AB are easily verified and correspond 
to how we intuitively think of directed line segments: 

(i) AB= 0 A = B; 

(ii) RA= - AB; 

(iii) AB + BC=AC (the triangle law); 

(iv) lBC=AC - AB= C - B; 

(v) if X is a vector and A a point, there is exactly one point B such that 
AB= X, namely that defined by B = A + X. 

To derive properties of the distance function and the vector function 
P±P- 2 , we need to introduce the dot product of two vectors in M 3 . 



8.3 Dot product 



«i 


a 2 

DEFINITION 8.3.1 If X = 

bi 

and Y = 

b 2 


. c i 


. C 2 


dot product of X and Y, is defined by 


then X ■ Y, the 


X Y = aia-2 + bib 2 + cic 2 . 
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B B 




Figure 8.5: The negative of a vector. 




Figure 8.6: (a) Equality of vectors; (b) Addition and subtraction of vectors. 


The dot product has the following properties: 

(i) X ■ (Y + Z) = X ■ Y + X ■ Z; 

(ii) X Y = Y ■ X- 

(hi) (tX) ■ Y = t(X ■ y); 


(iv) X-X = a 2 + b 2 + c 2 if A 


a 


b 


c 


(v) X-Y = X t Y- 

(vi) X ■ X = 0 if and only if X = 0. 

The length of X is defined by 

\\X\\ = Va 2 + b 2 + c 2 = {X • X) 1 / 2 . 


We see that ||P|| = OP and more generally || P 1 P 2 || = P 1 P 2 , the 
distance between P\ and P- 2 - 
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z 



X 

Figure 8.7: Position vector as a linear combination of i, j and k. 

Vectors having unit length are called unit vectors. 

The vectors 



are unit vectors. Every vector is a linear combination of i, j and k: 

a 

b = ai + 6j + ck. 
c 

(See Figure 8.7.) 

It is easy to prove that 

||**|| = |*HI*II, 

if t is a real number. Hence if X is a non-zero vector, the vectors 



are unit vectors. 

A useful property of the length of a vector is 

||*±y || 2 = ||A|| 2 ±2A-y + ||v|| 2 . 


(8.2) 
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The following important property of the dot product is widely used in 
mathematics: 

THEOREM 8.3.1 (The Cauchy— Schwarz inequality) 

If X and Y are vectors in R 3 , then 

\X-Y\ < ||X|| • ||y||. (8.3) 


Moreover if X ^ 0 and F/0, then 

X -Y = \\X\\-\\Y\\ ^ Y = tX,t> 0, 

X ■ Y = -\\X\\ ■ \\Y\\ ^ Y = tX,t< 0. 

Proof. If X = 0, then inequality 8.3 is trivially true. So assume I/O. 
Now if t is any real number, by equation 8.2, 

0<||Cf-T|| 2 = \\tX\\ 2 -2{tX) -Y +\\Y\\ 2 
= t 2 \\X\\ 2 - 2(X ■ Y)t + \\Y\\ 2 
= at 2 — 2 bt + c, 

where a = ||X|| 2 > 0, b = X ■ Y, c = \\Y\\ 2 . 

Hence 


a(t 2 



a a 



> 0 
> 0 


Substituting t = b/a in the last inequality then gives 

ac — b 2 
— — >0, 
a z 

so 

|6| < \[ac. = \fa\fc 
and hence inequality 8.3 follows. 

To discuss equality in the Cauchy-Schwarz inequality, assume I/O 
and Y / 0. 

Then if X ■ Y = ||X|| • ||T||, we have for all t 

| \tX — Y\ \ 2 = t 2 \\X\\ 2 - 2tX ■ Y + \\Y\\ 2 

= t 2 ||x|| 2 -2t||x||-||y|| + ||y|| 2 
= \\tX-Y\\ 2 . 

Taking t = ||X||/||y|| then gives || tX — Y\\ 2 = 0 and hence tX — Y = 0. 
Hence Y = tX, where t > 0. The case X Y = — ||X|| -||y is proved similarly. 
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COROLLARY 8.3.1 (The triangle inequality for vectors) 

If X and Y are vectors, then 

\\X + Y\\ < \\X\\ + ||Y||. (8.4) 

Moreover if X ^ 0 and Y ^ 0, then equality occurs in inequality 8.4 if and 
only if Y = tX, where t > 0. 

Proof. 

||A + Y|| 2 = \\X\\ 2 + 2X ■ Y + \\Y\\ 2 

< ||A|| 2 + 2||A|| • ||Y|| + ||Y|| 2 
= (||A|| + ||Y||) 2 

and inequality 8.4 follows. 

If 1 1 A + Y|| = ||X|| + ||Y||, then the above proof shows that 

X Y = \\X\\ • ||Y||. 

Hence if X 0 and F/0, the first case of equality in the Cauchy-Schwarz 
inequality shows that Y = tX with t > 0. 

The triangle inequality for vectors gives rise to a corresponding inequality 
for the distance function: 

THEOREM 8.3.2 (The triangle inequality for distance) 

If A, B, C are points, then 

AC < AB + BC. (8.5) 

Moreover if B ^ A and B C, then equality occurs in inequality 8.5 if and 
only if AB= r AC, where 0 < r < 1. 

Proof. 

AC = 1 1 AC 1 1 = \\AB + BC\\ 

< || AB || + || BC || 

= AB + BC. 

Moreover if equality occurs in inequality 8.5 and B ^ A and B ^ C, then 
X =ABy £ 0 and Y =BC ^ 0 and the equation AC = AB + BC becomes 
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||A + T|| = ll-X’H + ||y||. Hence the case of equality in the vector triangle 
inequality gives 

Y =BC= tX = t AB , where t > 0. 


Then 

BC = AC -AB=tAB 
AC = (l + t)AB 
AB = r AC, 

where r = l/(t + 1) satisfies 0 < r < 1. 


8.4 Lines 

DEFINITION 8.4.1 A line in E 3 is the set jC(Pq, X ) consisting of all 
points P satisfying 

P = Po + tX, te R or equivalently PqP= tX, (8.6) 

for some fixed point Po and fixed non-zero vector X. (See Figure 8.8.) 
Equivalently, in terms of coordinates, equation 8.6 becomes 

x = xo + ta, y = yo + tb, z = zq + tc, 

where not all of a, b, c are zero. 

The following familiar property of straight lines is easily verified. 

THEOREM 8.4.1 If A and B are distinct points, there is one and only 
one line containing A and B, namely C(A, AB) or more explicitly the line 
defined by AP= t AB , or equivalently, in terms of position vectors: 

P = (1 — f)A + fB or P = A + t AB . (8.7) 

Equations 8.7 may be expressed in terms of coordinates: if A = (aq, yi, z{) 
and B = (x 2 , yi, z 2 ), then 

x = (1 - t)x\ + tx 2 , y = (1 — t)y\ + ty 2 , z = (1 - t)z\ + tz 2 . 
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There is an important geometric significance in the number t of the above 
equation of the line through A and B. The proof is left as an exercise: 


THEOREM 8.4.2 (Joachimsthal’s ratio formulae) 

If t is the parameter occurring in theorem 8.4.1, then 


A P 

(.) W = ^; (ii) 


1 -t 


= — if P^B. 
PB T 


Also 


(iii) P is between A and B if 0 < t < 1; 

(iv) B is between A and P if 1 < t; 

(v) A is between P and B if t < 0. 


(See Figure 8.9.) 

For example, f = ^ gives the mid-point P of the segment AB: 

P=^(A + B). 

EXAMPLE 8.4.1 £ is the line AB, where A = (—4, 3, 1), B = (1, 1, 0); 
M. is the line CD, where C = (2, 0, 2), D = (—1, 3, —2); M is the line EF, 
where E = (1, 4, 7), F = (—4, —3, —13). Find which pairs of lines intersect 
and also the points of intersection. 

Solution. In fact only C and A f intersect, in the point (— |, |, |). For 
example, to determine if C and J\T meet, we start with vector equations for 
C and Af: 

P = A + f AB, Q = E + sEJF, 
equate P and Q and solve for s and t: 

(-4i + 3j + k) + t( 5i - 2j - k) = (i + 4j + 7k) + s(-5i — 7j — 20k), 
which on simplifying, gives 

5t + 5s = 5 
-2 1 + 7s = 1 
—t + 20s = 6 

This system has the unique solution t = |, s = ^ and this determines a 
corresponding point P where the lines meet, namely P = (— |, |, ^). 

The same method yields inconsistent systems when applied to the other 
pairs of lines. 
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EXAMPLE 8.4.2 If A = (5, 0, 7) and B = (2, —3, 6), find the points P 
on the line AB which satisfy AP/PB = 3. 

Solution. Use the formulae 

P = A + t AB and 



Then 


= 3 or — 3, 

1 -t 


so t = | or t = |. The corresponding points are (^, and (5, ). 


DEFINITION 8.4.2 Let X and Y be non-zero vectors. Then A is parallel 
or proportional to Y if X = tY for some t £ R. We write X|| Y if X is parallel 
to Y. If X = tY, we say that X and Y have the same or opposite direction , 
according asf> 0 orf< 0 . 


DEFINITION 8.4.3 if A and B are distinct points on a line C, the non- 
zero vector AB is called a direction vector for C. 


It is easy to prove that any two direction vectors for a line are parallel. 

DEFINITION 8.4.4 Let C and M. be lines having direction vectors X 
and Y, respectively. Then C is parallel to AA if X is parallel to Y . Clearly 
any line is parallel to itself. 

It is easy to prove that the line through a given point A and parallel to a 
given line CD has an equation P = A + t CD. 

THEOREM 8.4.3 Let X = aii + &ij + cik and Y = a 2 i + Hj + C2k be 
non-zero vectors. Then X is parallel to Y if and only if 


a\ 

bi 


bi 

Cl 


ai 

Cl 

<32 

b 2 


b 2 

C2 


a 2 

c 2 


Proof. The case of equality in the Cauchy-Schwarz inequality (theorem 8.3.1) 
shows that X and Y are parallel if and only if 


X • Y 


XW-WY 


Squaring gives the equivalent equality 

(01O2 + ^i&2 T cic 2 ) 2 = (af + b\ + c 2 )(d2 T &2 d~ c\) 
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which simplifies to 


(ai&2 — ®2^l)^ + ip\C2 ~ t>2Ci) 2 + (aiC2 — CL 2 C 1) 2 — 0, 


which is equivalent to 

ai& 2 - = 0, b\C2 - b 2 c\ = 0, aic 2 - a 2 ci = 0, 

which is equation 8.8. 

Equality of geometrical vectors has a fundamental geometrical interpre- 
tation: 

THEOREM 8.4.4 Suppose A. B, C, D are distinct points such that no 

three are collinear. Then AB=CD if and only if AB || CD and AC || BD 
(See Figure 8.1.) 

Proof. If AB=CD then 


B - A = D - C, 

C-A = D-B 

and so AC=BD. Hence AB || CD and AC || BD. 

Conversely, suppose that AB || CD and AC |j BD. Then 

AB= s CD and AC= t BD, 


or 

B — A = s(D — C) and C-A = fD-B. 

We have to prove s = 1 or equivalently, t = 1. 

Now subtracting the second equation above from the first, gives 

B — C = s(D — C) — f(D — B), 


so 

(1 -i)B = (1 -s)C + (s-t)D. 


If f 1, then 


B = 


1 — s 

1-t 


C + 


s — t 


1-t 


D 


and B would lie on the line CD. Hence t = 1. 
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8.5 The angle between two vectors 

DEFINITION 8.5.1 Let X and Y be non-zero vectors. Then the angle 
between X and Y is the unique value of 9 defined by 


cos 9 = 


X ■ Y 


0 < 9 < it. 


REMARK 8.5.1 By Cauchy’s inequality, we have 


-1 < 


X Y 


< 1 , 


.1*11 -imi 

so the above equation does define an angle 9. 

In terms of components, if X = [aq, b\, ci] 1 and Y = [a 2 , b 2 , 02 ]*, then 

ai «2 + b\b 2 + cic 2 


cos 9 = 


(8.9) 


\Ja\ + bj + cj \Ja\ + bl + c\ 

The next result is the well-known cosine rule for a triangle. 

THEOREM 8.5.1 (Cosine rule) If A, B , C are points with A ^ B and 
i/C, then the angle 9 between vectors AB and AC satifies 

AB 2 + AC 2 - BC 2 


cos u = 


2 AB ■ AC 


( 8 . 10 ) 


or equivalently 


BC 2 = AB 2 + AC 2 - 2 AB ■ AC cos 9. 


(See Figure 8.10.) 

Proof. Let A = (aq, y±, zi), B = (x 2 , 2/2, z 2 ), C = (x 3 , y 3 , ^3)- Then 

AB = aii + b\j + cik 
AC = a 2 i + 62 J + C 2 k 
BC = (o 2 - ai)i + ( 6 2 - 6i)j + (c 2 - ci)k, 

where 

a 1 — %i+i bi — y i+ i yi, Cj — z 1 , i — 1, 2. 
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Figure 8.10: The cosine rule for a triangle. 


Now by equation 8.9, 


cos 9 


aW2 + hb 2 + cic 2 

AB- AC 


Also 


AB 2 + AC 2 — BC 2 = (a 2 + b\ + c 2 ) + (a 2 + b 2 + c 2 2 ) 

- ((a 2 - ai) 2 + ( b 2 - bi) 2 + (c 2 - ci) 2 ) 
= 2aia 2 + 2bib 2 + cic 2 . 

Equation 8.10 now follows, since 


AB ■ AC= a\tt2 + b\b2 + cic 2 . 


EXAMPLE 8.5.1 Let A = (2, 1, 0), B = (3, 2, 0), C = (5, 0, 1). Find 
the angle 6 between vectors AB and AC . 


Solution. 


cos 9 


AB ■ AC 
AB- AC' 


Now 


AB= i + j and AC=‘5i— j + k. 
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z 



Figure 8.11: Pythagoras’ theorem for a right-angled triangle. 


Hence 

. 1 x 3 + 1 x (-1) + 0 x 1 2 v/2 

cos 9 = — . — . = = — i= — 1 = = — 1 = . 

Vi 2 + 1 2 + o 2 v 732 + (- 1 ) 2 + 12 v/2\/TT v'IT 

Hence 0 = cos -1 

DEFINITION 8.5.2 If X and Y are vectors satisfying X ■ Y = 0, we say 
X is orthogonal or perpendicular to Y. 

REMARK 8.5.2 If A, B , C are points forming a triangle and AB is or- 
thogonal to AC, then the angle 9 between AB and AC satisfies cos 9 = 0 
and hence 9 = | and the triangle is right-angled at A. 

Then we have Pythagoras ’ theorem: 

BC 2 = AB 2 + AC 2 . (8.11) 

We also note that BC > AB and BC > AC follow from equation 8.11. (See 
Figure 8.11.) 


EXAMPLE 8.5.2 Let A = (2, 9, 8), B = (6, 4, -2), C = (7, 15, 7). Show 

that AB and AC are perpendicular and find the point D such that ABDC 
forms a rectangle. 
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y 

x 

Figure 8.12: Distance from a point to a line. 

Solution. 

AH ■ AC= (4i - 5j - 10k) • (5i + 6j - k) = 20 - 30 + 10 = 0. 

Hence AB and AC are perpendicular. Also, the required fourth point D 
clearly has to satisfy the equation 

BD=AC, or equivalently D — B =AC . 

Hence 

D = B+ AC= (6i + 4j — 2k) + (5i + 6j — k) = Hi + lOj — 3k, 
so D = (11, 10, -3). 

THEOREM 8.5.2 (Distance from a point to a line) If C is a point 
and C is the line through A and B, then there is exactly one point P on C 
such that CP is perpendicular to AB, namely 

. A ( 7 . A f> 

P = A + tAB, t= AB2 . (8.12) 

Moreover if Q is any point on C, then CQ > CP and hence P is the point 
on C closest to C. 
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The shortest distance CP is given by 

J AC 2 AB 2 - (AC ■ AB) 2 

CP = V -• (8.13) 

(See Figure 8.12.) 

Proof. Let P = A + t AB and assume that CP is perpendicular to AB. 
Then 


CP ■ AB = 0 
(P — C )-AB = 0 
(A + i AH -C)- AB = 0 
(CA+tAB)-AB = 0 
CA ■ AB +t(AB ■ AB) = 0 
-AC-AB +t(AB ■ AB) = 0, 

so equation 8.12 follows. 

The inequality CQ > CP, where Q is any point on C, is a consequence 
of Pythagoras’ theorem. 

Finally, as CP and PA are perpendicular, Pythagoras’ theorem gives 

CP' 2 = AC 2 -PA 2 

= AC 2 -\\tAB\\ 2 
= AC 2 — t 2 AB 2 

- 

AC 2 AB 2 - (AC ■ AB) 2 
AB 2 ’ 

as required. 

EXAMPLE 8.5.3 The closest point on the line through A = (1, 2, 1) and 
B = (2, —1, 3) to the origin is P = (j|, j|, ff) and the corresponding 
shortest distance equals \/42- 

Another application of theorem 8.5.2 is to the projection of a line segment 
on another line: 
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C i 



x 

Figure 8.13: Projecting the segment C 1 C 2 onto the line AB. 




172 


CHAPTER 8. THREE-DIMENSIONAL GEOMETRY 


so 


PiP 2 = 


where n is the unit vector 

Inequality 8.14 then follows from the Cauchy-Schwarz inequality 8.3. 

DEFINITION 8.5.3 Two non-intersecting lines are called skew if they 
have non-parallel direction vectors. 

Theorem 8.5.3 has an application to the problem of showing that two skew 
lines have a shortest distance between them. (The reader is referred to 
problem 16 at the end of the chapter.) 

Before we turn to the study of planes, it is convenient to introduce the 
cross-product of two vectors. 


P\P-2 || = |*2 — h\AB 
AC 2 AB AC! -AB 


AB 2 


C X C2 ■ AB 


AB 2 


AB 


AB 2 
C^C 2 -n 


AB 




8.6 The cross product of two vectors 

DEFINITION 8.6.1 Let X = aqi + 6ij + cik and Y = a 2 i + b 2 j + C2k. 
Then X x Y, the cross-product of X and Y, is defined by 

X x Y = ai + b] + ck, 


where 


a = 


h Cl 

b 2 c 2 



a i ci 

«2 C2 


ai 6i 

a 2 b 2 


The vector cross-product has the following properties which follow from 
properties of 2 x 2 and 3x3 determinants: 

(i) i x j = k, j X k = i, k X i = j; 
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(ii) X x X = 0-, 

(iii) Y x X = -X x Y; 

(iv) X x (Y + Z) = X x Y + X x Z; 

(v) (tX) xY = t(X x y); 

(vi) (Scalar triple product formula) if Z = a 3 i + b 3 j + 03k, then 

a\ b\ ci 

X-(YxZ)= a 2 b 2 c 2 =(XxY)-Z- 

a 3 b 3 C3 

(vii) X ■ (X xY) = 0 = Y ■ (X xY)-, 

(viii) \\X x Y\\ = v / ||^:|| 2 ||^|| 2 - ( X-Y ) 2 ; 

(ix) if X and Y are non-zero vectors and 9 is the angle between X and Y , 
then 

\\X x Y\\ = ||X|| • ||y|| sin$. 

(See Figure 8.14.) 

From theorem 8.4.3 and the definition of cross-product, it follows that 
non-zero vectors X and Y are parallel if and only if X xf = 0; hence by 
(vii), the cross-product of two non-parallel, non-zero vectors X and Y, is 
a non-zero vector perpendicular to both X and Y . 

LEMMA 8.6.1 Let X and Y be non-zero, non-parallel vectors. 

(i) Z is a linear combination of X and Y, if and only if Z is perpendicular 
to X xY] 

(ii) Z is perpendicular to X and Y . if and only if Z is parallel to X x Y. 
Proof. Let X and Y be non-zero, non-parallel vectors. Then 

1x7/0. 

Then if X x Y = ai + fcj + ck, we have 

a b c 

det [X x Y\X\YY = ai 61 ci = (X x Y) ■ (X x Y) > 0. 
a 2 b 2 c 2 
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z 



Figure 8.14: The vector cross-product. 


Hence the matrix [X x y|X|y] is non-singular. Consequently the linear 
system 

r(X xY) + sX + tY = Z (8.15) 

has a unique solution r, s, t. 

(i) Suppose Z = sX + tY. Then 

Z ■ (X x Y) = sX ■ (X x Y) + tY ■ (X x Y) = sO + tO = 0. 

Conversely, suppose that 


Z-(XxY)= 0. (8.16) 

Now from equation 8.15, r, s, t exist satisfying 

Z = r{X xY) + sX + tY. 

Then equation 8.16 gives 

0 = (r(X xh) + sl + tY) ■ (X x Y) 

= r\\X xY\\ 2 + sX ■ (X xY) + tY ■ (Y x X) 

= r\\XxY\\ 2 . 


Hence r = 0 and Z = sX + tY, as required. 

(ii) Suppose Z = X(X x Y). Then clearly Z is perpendicular to X and Y. 
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Conversely suppose that Z is perpendicular to X and Y. 
Now from equation 8.15, r, s, t exist satisfying 

Z = r{X xY) + sX + tY. 


Then 


sX -X + tX -Y = X • Z = 0 
sY-X + tY-Y = Y ■ Z = 0, 

from which it follows that 

(sX + tY) • (sX + tY) = 0. 

Hence sX + tY = 0 and so s = 0, t = 0. Consequently Z = r(X x Y). as 
required. 

The cross-product gives a compact formula for the distance from a point 
to a line, as well as the area of a triangle. 

THEOREM 8.6.1 (Area of a triangle) 

If A, B, C are distinct non-collinear points, then 

(i) the distance d from C to the line AB is given by 


d 


AB x AC 
~AB 


(8.17) 


(ii) the area of the triangle ABC equals 

II AB x AC II 1 1 A x B + B x C + C x A| 


2 2 

Proof. The area A of triangle ABC is given by 

. AB ■ CP 

A = , 


(8.18) 


where P is the foot of the perpendicular from C to the line AB. Now by 
formula 8.13, we have 


CP = 


\J AC 2 ■ AB 2 - (AC ■ AB) 2 


AB 


AB x AC 


AB 
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which, by property (viii) of the cross-product, gives formula 8.17. The 
second formula of equation 8.18 follows from the equations 

AH x AC = (B — A) x (C — A) 

= {(B — A) x C} — {(C - A) x A} 

= {(B x C — A x C)} - {(B x A - A x A)} 

= BxC-AxC-BxA 
= BxC + CxA + AxB, 

as required. 

8.7 Planes 

DEFINITION 8.7.1 A plane is a set of points P satisfying an equation 
of the form 

P = Po + sX + tY, s, t G M, (8.19) 

where X and Y are non-zero, non-parallel vectors. 

For example, the .xy-plane consists of the points P = ( x , y, 0) and corre- 
sponds to the plane equation 

P = xi + yj = O + xi + y j . 

In terms of coordinates, equation 8.19 takes the form 

x = xq + sai + ta 2 
V = yo + sbi + tb 2 
Z = Z 0 + SC\ + tc 2 , 

where Po = (-An Mu -^o) and (ai, bi, c\) and ( 02 , 62 , c\2 are non-zero and 
non-proportional . 

THEOREM 8.7.1 Let A , B, C be three non-collinear points. Then there 
is one and only one plane through these points, namely the plane given by 
the equation 

P = A + s AB +t AC, (8.20) 

or equivalently 


(See Figure 8.15.) 


AP= s AB +t AC . 


(8.21) 
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Figure 8.15: Vector equation for the plane ABC. 

Proof. First note that equation 8.20 is indeed the equation of a plane 

through A, B and C, as AB and AC are non-zero and non-parallel and 
( s , t) = (0, 0), (1, 0) and (0, 1) give P = A, B and C, respectively. Call 
this plane V. 

Conversely, suppose P = Po + sX + tY is the equation of a plane Q 
passing through A, B, C. Then A = Po + soA + to Y, so the equation for 
Q may be written 

P = A + (s - s 0 ) X + (t - t 0 )Y = A + s' X + t'Y- 

so in effect we can take Po = A in the equation of Q. Then the fact that B 
and C lie on Q gives equations 

B = A + si A + BY, C = A + s 2 V + t 2 Y, 

or 

~AB= si A + ti Y, AC= s 2 X + t 2 Y. (8.22) 

Then equations 8.22 and equation 8.20 show that 

peg. 

Conversely, it is straightforward to show that because AB and AC are not 
parallel, we have 

/ 0 . 


si h 

s 2 t 2 
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Figure 8.16: Normal equation of the plane ABC. 


Hence equations 8.22 can be solved for X and Y as linear combinations of 
AB and AC, allowing us to deduce that 

QCV. 


Hence 


Q = V. 


THEOREM 8.7.2 (Normal equation for a plane) Let 

A = (aq, yi, z 3 ), B = (x 2 , 2 / 2 , z 2 ), C = (x 3 , y 3 , z 3 ) 
be three non-collinear points. Then the plane through A, B, C is given by 

AP ■( AB x AC) = 0, (8.23) 


or equivalently, 

x 

x 2 

X3 


X\ 

y 

- y i 

z 

~ Z 1 

Xi 

y2 

- 2/1 

Z2 

- Zl 

Xi 

y3 

- 2/1 

Z3 

- Zl 


= 0, (8.24) 


where P = (x, y, z ). (See Figure 8.16.) 
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Figure 8.17: The plane ax + by + cz = d. 


REMARK 8.7.1 Equation 8.24 can be written in more symmetrical form 
as 



X 

y 

z 

1 

X\ 

yi 

Zi 

1 

X 2 

2/2 

Z2 

1 

X 3 

2/3 

Z3 

1 


(8.25) 


Proof. Let V be the plane through A, B, C. Then by equation 8.21, we 
have P G V if and only if AP is a linear combination of AB and AC and so 
by lemma 8.6.1 (i) , using the fact that AB x AC ^ 0 here, if and only if AP 

is perpendicular to AB x AC. This gives equation 8.23. 

Equation 8.24 is the scalar triple product version of equation 8.23, taking 
into account the equations 


AP = (x - xi)i+ (y - yi)j + (z - zi)k, 
AB = (x 2 - xi)i+ (y 2 - yi)j + (z 2 - zi)k, 
AC = (x 3 - xi)i+ (y 3 - yi)j + (z 3 - zi)k. 


REMARK 8.7.2 Equation 8.24 gives rise to a linear equation in x, y and 
z: 


ax + by + cz = d, 
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where ai + b] + ck 7^ 0 . For 

x-xi y-yi z-zi 

X2 - Xl J/2 - Vi Z 2 - Zl 
X3 - Xl 2/3 - 2/1 2:3 - Zl 


X 

y 

z 


Xl 

2/1 

Zl 

x 2 - Xl 

V 2 - yi 

Z2 - Zl 

— 

H 

to 

1 

H 

2/2 - 2/1 

Z2 - Zl 

H 

1 

CO 

H 

2/3 - yi 

CO 

1 


CO 

1 

2/3 - 2/1 

hi 

CO 

1 


and expanding the first determinant on the right-hand side of equation 8.26 
along row 1 gives an expression 


ax + by + cz 


where 


a = 

2/2 - 2/1 z 2 Zl 

,b=- 

X 2 - Xl z 2 - Zl 

, C = 

X2 - Xl 2/2 - 2/1 


2/3 - 2/1 z 3 - z 1 


X3 - Xl Z3- Zl 


®3 - Xl 2/3 - 2/1 


But a, b, c are the components of AB x AC, which in turn is non-zero, as 
A, B, C are non-collinear here. 

Conversely if ai + bj + ck / 0, the equation 

ax + by + cz = d 

does indeed represent a plane. For if say a/0, the equation can be solved 
for x in terms of y and z: 


X 

y 

— 

- _d - 

a 

0 

+ y 

- _b - 
a 

1 

+ z 

c 

a 

0 

z 


0 _ 


0 _ 


1 . 


which gives the plane 

P = P 0 + yX + zY, 

where Po = (— 0, 0) and X = — -i + j and Y = — -i + k are evidently 
non-parallel vectors. 


REMARK 8.7.3 The plane equation ax + by + cz = d is called the normal 
form, as it is easy to prove that if P\ and P 2 are two points in the plane, 

then ai + fej + ck is perpendicular to P\P 2 . Any non-zero vector with this 
property is called a normal to the plane. (See Figure 8.17.) 
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By lemma 8.6.1 (ii) , it follows that every vector X normal to a plane 
through three non-collinear points A, B, C is parallel to AB x AC, since 
X is perpendicular to AB and AC. 

EXAMPLE 8.7.1 Show that the planes 

x + y — 2z = 1 and x + 3y — z = 4 

intersect in a line and find the distance from the point C = (1, 0, 1) to this 
line. 

Solution. Solving the two equations simultaneously gives 


1 5 


3 1 


x = 1 — z, y = z, 

2 2 ’ a 2 2 ’ 


(8.27) 


where z is arbitrary. Hence 


1. 3. ,5. 1. 

x'i + yj + zk = --i - -j + z{- 1 - -j + k), 

which is the equation of a line C through A = 0) and having 

direction vector |i — + k. 

We can now proceed in one of three ways to find the closest point on C 
to A. 

One way is to use equation 8.17 with B defined by 

AB= -i- -j + k. 

2 2 

Another method minimizes the distance CP, where P ranges over C. 

A third way is to find an equation for the plane through C, having 
|i — |j + kasa normal. Such a plane has equation 

5x — y + 2z = d, 

where d is found by substituting the coordinates of C in the last equation. 

d = 5xl — 0 + 2xl = 7. 


We now find the point P where the plane intersects the line C. Then CP 
will be perpendicular to C and CP will be the required shortest distance 
from C to C. We find using equations 8.27 that 

,15, 3 1 , 

5(-2 + 2 3 ) - ( 2 - 2 Z,+23 = 7 > 
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x 


Figure 8.18: Line of intersection of two planes. 


80 * = I5- Hence P=(f, ff, H). 

It is clear that through a given line and a point not on that line, there 
passes exactly one plane. If the line is given as the intersection of two planes, 
each in normal form, there is a simple way of finding an equation for this 
plane. More explicitly we have the following result: 

THEOREM 8.7.3 Suppose the planes 

a\x + biy + c\z = d\ (8.28) 

a 2 x + b 2 y + c 2 z = d 2 (8.29) 

have non-parallel normals. Then the planes intersect in a line C. 

Moreover the equation 

X(aix + b±y + c\z - d\) + y(a 2 x + b- 2 y + c 2 z - d 2 ) = 0, (8.30) 

where A and y are not both zero, gives all planes through C. 


(See Figure 8.18.) 

Proof. Assume that the normals aii + &ij + cik and a 2 i + b 2 j + 02k are 
non-parallel. Then by theorem 8.4.3, not all of 



A2 


h Cl 
b 2 c 2 



ai h 
a 2 b 2 


«1 Cl 

a 2 C2 


(8.31) 
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are zero. If say Ai 7^ 0, we can solve equations 8.28 and 8.29 for x and y in 
terms of z, as we did in the previous example, to show that the intersection 
forms a line C. 

We next have to check that if A and y are not both zero, then equa- 
tion 8.30 represents a plane. (Whatever set of points equation 8.30 repre- 
sents, this set certainly contains C.) 

(A01 + ya 2 )x + (Xbi + yb 2 )y + (Aci + yc 2 )z - (Adi + yd 2 ) = 0. 

Then we clearly cannot have all the coefficients 

Aai + ^a 2 , Xbi + yb 2 , Xc\ + yc 2 

zero, as otherwise the vectors aii + &ij + cik and a 2 i + b 2 j + c 2 k would be 
parallel. 

Finally, if V is a plane containing £, let Po = (xo, 2/Oj zo) be a point not 
on C. Then if we define A and y by 

A = ~(a 2 x 0 + b 2 y 0 + c 2 z 0 -d 2 ), y = aix 0 + &12/0 + ciz 0 - d 1; 

then at least one of A and y is non-zero. Then the coordinates of Po satisfy 
equation 8.30, which therefore represents a plane passing through C and Pq 
and hence identical with V. 

EXAMPLE 8.7.2 Find an equation for the plane through Po = (1> 0, 1) 
and passing through the line of intersection of the planes 

x + y — 2z = 1 and x + 3y — z = 4. 

Solution. The required plane has the form 

A(x + y — 2z — 1) + y(x + 2>y — z — 4) = 0, 

where not both of A and y are zero. Substituting the coordinates of Po into 
this equation gives 

— 2A + y(— 4) = 0, A = —2 y. 

So the required equation is 

-2 y(x + y - 2z — 1) + y(x + 3y - z - 4) = 0, 


or 

—x + y + 3z — 2 = 0. 

Our final result is a formula for the distance from a point to a plane. 
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Figure 8.19: Distance from a point Po to the plane ax + by + cz = d. 


THEOREM 8.7.4 (Distance from a point to a plane) 

Let Po = 1 / 0 ; zo) and V be the plane 

ax + by + cz = d. (8.32) 

Then there is a unique point P on V such that PqP is normal to V. Morever 

PnP = \ ax o + by 0 + czp - d\ 

V a 2 + b 2 + c 2 

(See Figure 8.19.) 

Proof. The line through Po normal to V is given by 

P = P 0 + t(ai + bj + ck), 

or in terms of coordinates 

x = x 0 + at , y = y Q + bt, z = z 0 + ct. 

Substituting these formulae in equation 8.32 gives 

a(x 0 + at) + b(y 0 + bt) + c(z 0 + ct) = d 

t(a 2 + b 2 + c 2 ) = -(ax 0 + by 0 + cz 0 - d), 


so 


( ax o + byp + cz 0 - d \ 


t = — 


a 2 + b 2 + c 2 
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Then 

iV* =11^11 = ||t(ai + &j + ck)|| 

= |t| \/ a 2 + b 2 + c 2 

\axp + by 0 + czp - 
a 2 + b 2 + c 2 
\axp + by 0 + czp - 

\J a 2 + b 2 + c 2 

Other interesting geometrical facts about lines and planes are left to the 
problems at the end of this chapter. 

8.8 PROBLEMS 


— \/a 2 + b 2 + c 2 
d\ 


1. Find the point where the line through A = (3, —2, 7) and B = 
(13, 3, —8) meets the xz- plane. 

[Ans: (7, 0, 1).] 

2. Let A, B , C be non-collinear points. If E is the mid-point of the 
segment BC and F is the point on the segment EA satisfying = 2, 
prove that 

F = J(A + B + C). 

(F is called the centroid of triangle ABC.) 

3. Prove that the points (2, 1, 4), (1, —1, 2), (3, 3, 6) are collinear. 

4. If A = (2, 3, —1) and B = (3, 7, 4), find the points P on the line AB 
satisfying PA/ PB = 2/5. 

[Ans: (f , f , f) and (§, I, -f ).] 

5. Let A4 be the line through A = (1, 2, 3) parallel to the line joining 
B = (—2, 2, 0) and C = (4, —1, 7). Also M is the line joining E = 
(1, —1, 8) and F = (10, —1, 11). Prove that A4 and N intersect and 
find the point of intersection. 

[Ans: (7, -1, 10).] 



186 


CHAPTER 8. THREE-DIMENSIONAL GEOMETRY 


6 . Prove that the triangle formed by the points (—3, 5, 6 ), (—2, 7, 9) and 
(2, 1, 7) is a 30°, 60°, 90° triangle. 

7. Find the point on the line AB closest to the origin, where A = 
(—2, 1, 3) and B = (1, 2, 4). Also find this shortest distance. 

[Ans: (-if, ff, ff) and /f.] 

8 . A line Af is determined by the two planes 

x + y — 2z = 1, and x + 3y — z = 4. 

Find the point P on Af closest to the point C = (1, 0, 1) and find the 
distance PC. 

[Ans: (if, 1|, II) and *f>.] 

9. Find a linear equation describing the plane perpendicular to the line 
of intersection of the planes x + y — 2z = 4 and 3x — 2y + z = 1 and 
which passes through ( 6 , 0 , 2 ). 

[Ans: 3x + 7y + 5z = 28.] 

10. Find the length of the projection of the segment AB on the line C, 
where A = (1, 2, 3), B = (5, —2, 6 ) and C is the line CD, where 
C = (7, 1, 9) and D = (-1, 5, 8 ). 

[Ans: f ] 

11. Find a linear equation for the plane through A = (3, —1, 2), perpen- 
dicular to the line C joining B = (2, 1, 4) and C = (—3, —1, 7). Also 
find the point of intersection of C and the plane and hence determine 

the distance from A to C. [Ans: bx+2y— 3z = 7, (^, §§, 

12. If P is a point inside the triangle ABC, prove that 

P = ?’A + sB + tC, 

where r + s + t = 1 and r > 0 , s > 0 , t > 0 . 

13. If B is the point where the perpendicular from A = ( 6 , —1, 11) meets 
the plane 3x + ly + 5z = 10, find B and the distance AB. 


[A„ n , z? ( 123 —286 255 \ ^ a t? 59 

[Ans. B - ( w , -go-, - 50 J and AB - ^=. 
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14. Prove that the triangle with vertices (—3, 0, 2), (6, 1, 4), (—5, 1, 0) 
has area 2^333. 

15. Find an equation for the plane through (2, 1, 4), (1, —1, 2), (4, —1, 1). 
[Ans: 2x — 7y + 6z = 21.] 


16. Lines C and At are non-parallel in 3-dinrensional space and are given 
by equations 


P = A + sX, Q = B + tY. 


(i) Prove that there is precisely one pair of points P and Q such that 
PQ is perpendicular to X and Y . 


(ii) Explain why PQ is the shortest distance between lines C and Xi . 
Also prove that 


PQ 


(X x Y)- ~AB 

\\X x y\\ 


17. If C is the line through A = (1, 2, 1) and C = (3, —1, 2), while Xi 
is the line through B = (1, 0, 2) and D = (2, 1, 3), prove that the 
shortest distance between C and Xi equals 4=. 


18. Prove that the volume of the tetrahedron formed by four non-coplanar 
points Ai = (xi, yi , Zi), 1 < i < 4, is equal to 

- | (A\A 2 x AiA 3 )- AiA 4 |, 

6 

which in turn equals the absolute value of the determinant 


1 

6 


1 

Xl 

Vi 

Z\ 

1 

X2 

2/2 

Z 2 

1 

X3 

2/3 

Z 3 

1 

X 4 

2/4 

Z 4 


19. The points A = (1, 1, 5), B = (2, 2, 1), C = (1, —2, 2) and D = 
(—2, 1, 2) are the vertices of a tetrahedron. Find the equation of the 
line through A perpendicular to the face BCD and the distance of A 
from this face. Also find the shortest distance between the skew lines 
AD and BC. 

[Ans: P = (1 + f)(i + j + 5k); 2\/3; 3.] 
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FURTHER READING 

Matrix theory has many applications to science, mathematics, economics 
and engineering. Some of these applications can be found in the books 
[2, 3, 4, 5, 11, 13, 16, 20, 26, 28]. 

For the numerical side of matrix theory, [6] is recommended. Its bibliography 
is also useful as a source of further references. 

For applications to: 

1. Graph theory, see [7, 13]; 

2. Coding theory, see [8, 15]; 

3. Game theory, see [13] ; 

4. Statistics, see [9]; 

5. Economics, see [10]; 

6. Biological systems, see [12]; 

7. Markov non-negative matrices, see [11, 13, 14, 17]; 

8. The general equation of the second degree in three variables, see [18]; 

9. Affine and projective geometry, see [19, 21, 22]; 

10. Computer graphics, see [23, 24], 
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SECTION 1.6 


0 0 0 
2 4 0 

0 1 3 

1 2 4 


2. (i) 


(ii) 


(iii) 


R\ — * R\ + R 3 
R 3 —> —R 3 
R-2 R 3 


R\ <— > i?2 


R\ > i?2 


2 4 0 
0 0 0 

1 2 4 “ 
0 1 3 


R\ — > bR\ 


1 2 0 
0 0 0 


R\ — > — 24?2 


1 0 -2 
0 1 3 


' 1 

1 

1 ' 

R -2 - 

R 2 

- R\ 

' 1 

1 

0 ' 

1 

1 

0 

0 

0 

-1 

1 

0 

0 

r 3 - 

r 3 

— R\ 

0 

-1 

-1 


1 0 0 

0 1 1 

0 0-1 


R 2 — * Ri R3 
R 3 —> —R 3 


1 0 0 
0 1 0 
0 0 1 


(iv) 


3. (a) 


2 0 0 
0 0 0 
-4 0 0 


R 3 R 3 + 2Z?i 

i?l — ► \Ri 


1 0 0 
0 0 0 
0 0 0 


1115 
2 3—1 6 

1 -1 -1 -i 


i?2 - * ► R 2 ~ 2i?i 

R 3 —> R 3 — R\ 


1112 

0 1-3 4 

0 -2 -2 -10 


-Ri — * ► Ri — R2 
R 3 “ ► R 3 + 2i?2 

4 ?i — ► R\ — 4R3 
R/2 —■ ► i?2 + 3i?3 


10 4-2 

0 1-3 4 

0 0 - 8-2 

10 0-3 

0 1 0 f 

0 0 1 \ 


R 3 —> ~wRs 


1 0 
0 1 
0 0 


4 2 

-3 4 


1 


The augmented matrix has been converted to reduced row-echelon form 


i?2 ~~ ► R 2 ~ 3i?i 
i?3 — > R 3 + 5i?i 


2 10 

-2 -29 
0 1 

From the last matrix we see that the original system is inconsistent. 



1 

1 

-1 

2 

10 

(b) 

3 

-1 

7 

4 

1 


-5 

3 - 

-15 

-6 

9 




' 1 

1 

-1 

r 3 - 

-2 R 3 + 2 R> 

0 

-4 

10 




0 

0 

0 


z 

1 

— r 




1 

1 

-1 

2 

10 

0 

-4 

10 

-2 

-29 

0 

8 

-20 

4 

59 


1 



(c) 


R-2 

R-3 

i?4 


' 3 

-1 

7 

0 ' 


' 1 

-1 

1 

1 ' 

2 

-1 

4 

1 

2 

Tii R 3 

2 

-1 

4 

1 

2 

1 

-1 

1 

1 

3 

-1 

7 

0 

_ 6 

-4 

10 

3 


_ 6 

-4 

10 

3 


R '2 ~~ 27? i 

7?3 — 3i?i 
7?4 — 6i?i 


' 1 

-1 

1 

1 ' 



' 1 

0 

3 

-1 - 

0 

1 

2 

^3 

Tii - 

-> Tii + Ti2 

0 

1 

2 

2 

^3 

0 

2 

4 

2 

-3 

Ti 4 - 

Ti4 — T?3 

0 

0 

0 

2 

0 

_ 0 

2 

4 

-3 

R 3 - 

* Ti3 — 2Ti2 

_ 0 

0 

0 

0 _ 


The augmented matrix has been converted to reduced row-echelon form 
and we read off the complete solution x = — 3 — 3 z, y = — | — 2z, with z 
arbitrary. 



2 

-1 

3 

a 


2 

-1 

3 

a 1 

4. 

3 

1 

-5 

6 

R 2 —■ y R 2 — Ri 

1 

2 

—8 b — a 


-5 

-5 

21 

c 


-5 

-5 

21 

c J 


Til i?2 


1 

2 

—8 6 — a 

R 2 - 

R 2 — 2 Tii 

' 1 

2 

-8 

6 — a 

2 

-1 

3 a 

0 

-5 

19 

—26 + 3 a 

-5 

-5 

21 c 

Rs - 

R 3 T 5 Tii 

0 

5 

-19 

56 — 5a + c 


Rs 


R's + R 2 

5 


R 2 — >• 


1 2 


0 1 


-19 


b — a 

26— 3a 


0 0 


Til — * Ri — 2T?2 


1 0 ir 
0 1 " 19 

0 0 


0 36 — 2a + c 

(6+a) 

5 

26— 3a 
5 5 

0 36 — 2a + c 


From the last matrix we see that the original system is inconsistent if 
36 — 2a + c / 0. If 36 — 2a + c = 0, the system is consistent and the solution 
is 

(6 + a) 2 _ (26 — 3a) 19 


x = 

where z is arbitrary. 

1 1 1 
5. | t It. 

1 + t 2 3 


+ rz, y = 

5 


R 2 —■ y R 2 — tRi 

R 3 — Ti3 — (1 + t)R\ 


+ y z ’ 


1 1 1 

0 1 ~t 0 
0 1 ~t 2 ~t 


Ti,3 — > T?3 — R 2 


1 1 1 
0 1 — 6 0 

0 0 2 -t 


= B. 


Case 1. t / 2. No solution. 


2 




' 1 

0 

1 ' 


' 1 

0 

1 ' 

Case 2. t = 2. B = 

0 

-1 

0 

-> 

0 

1 

0 


0 

0 

0 


0 

0 

0 


We read off the unique solution x = 1, y = 0. 


6. Method 1 . 

' -3 1 1 

1 -3 1 

1 1 -3 

1 1 1 


' 1 

0 

0 

-1 ' 



' 1 

0 

0 

-1 ' 

0 

1 

0 

-1 

r 4 - 

R 4 — R 3 — R2 — Ri 

0 

1 

0 

-1 

0 

0 

1 

-1 

0 

0 

1 

-1 

_ 1 

1 

1 

-3 



_ 0 

0 

0 

0 _ 


Hence the given homogeneous system has complete solution 


1 

1 

1 

-3 


Rl — > i?i — R 4 
i?2 “ ^ R'2 ~ R 4 
R3 * R3 R 4 


-4 0 

0 -4 
0 0 

1 1 


0 4 

0 4 

-4 4 

1 -3 


Xl = X4, X2 = X4, X 3 = X4, 

with X 4 arbitrary. 

Method 2 . Write the system as 

xi + X2 + X 3 + X4 = 4xi 

x\ + X2 + X 3 + X4 = 4x 2 

x\ + x 2 + x 3 + x 4 = 4 x 3 

x\ + x 2 + x 3 + x 4 = 4 x 4 . 


Then it is immediate that any solution must satisfy xi = X 2 = x 3 = x 4 . 
Conversely, if x 4 , X 2 , x 3 , x 4 satisfy x\ = X 2 = x 3 = x 4 , we get a solution. 

7. 


' A-3 1 

1 A-3 

Ri <— > R 2 

1 1 

CO 

1 

<< 

CO 

1 

R 2 — >■ R 2 ~ 

1 

CO 

' 1 A-3 

0 -A 2 + 6A - 


= B. 


Case 1: —A 2 + 6A — 8 / 0. That is —(A — 2) (A — 4) / 0 or A / 2, 4. Here B is 

1 0 


row equivalent to 


R-2 


1 

— A 2 +6A— 8 


R 2 


0 1 

' 1 A- 3 
0 1 


— > Ri — (A — 3)R2 


1 0 
0 1 


Hence we get the trivial solution x = 0, y = 0. 
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Case 2: A = 2. Then B = 
arbitrary. 

Case 3: A = 4. Then B = 
arbitrary. 


1 -1 

0 0 

1 1 

0 0 


and the solution is x = y, with y 


and the solution is x = —y, with y 


'3 11 1 ' 

1 1 

r 1 11 ii 

5-11-1 

^3 

1 

CO | 

33 3 

5 - 11-1 


R-2 

R-2 

Ri 


R 2 — 5/?,] 
-3 


1 

0 


R -2 


1 i 4 i 

L 3 3 3 

0 1 I 1 


1 


Ri “ 3-^2 


1 0 I 0 
Dili 


Hence the solution of the associated homogeneous system is 


1 


1 


xi = — -X3, x 2 = --x 3 - x 4 , 


with .T 3 and X 4 arbitrary. 
9. 


A = 


1 — n 1 
1 1 — n 


1 

1 

1 — n 


R\ — ¥ R\ — Rn 
II 2 R 2 — R)i 


Rn— 1 


Rn — 1 Rn 


— n 0 

0 — n 

1 1 


' 1 

0 • • 

.. _i 


' 1 

0 •• 

1 

T— 1 

1 

0 

1 • • 

• -1 

R n * Rn Rn— 1 * * ‘ R\ 

0 

1 •• 

• -1 

1 

1 • • 

■ • 1—77. 


O • 

• O 

1 

' O 


n 

n 


1 — n 


The last matrix is in reduced row-echelon form. 

Consequently the homogeneous system with coefficient matrix A has the 
solution 

X\ = X n , X2 — x n , • . • , x n — 1 = 


4 



with x n arbitrary. 

Alternatively, writing the system in the form 

xi H V x n = nx i 

x\ H b X n = ns 2 


xi H b = nx„ 

shows that any solution must satisfy nx\ = 71 x 2 = ■ ■ ■ = nx n , so x\ = X 2 = 
• ■ • = x n . Conversely if x\ = x n , . . . , x n _i = x n , we see that x±, . . . , x n is a 
solution. 

a b 


10. Let A = 
Case 1: a / 0. 


c d 


and assume that ad — be ^ 0. 


a b 



[ 1-1 


r 1 * 1 

c d 

Ri - 

* 5*1 

a 

c d 

R2 — ► R2 — cR. 1 

a 

q ad— be 

a 



' 1 ^ ' 


' 1 o' 

*2 - J^ c R2 

a 

0 1 


0 ! 


Case 2: a = 0. Then be / 0 and hence c/ 0. 


A = 


0 b 
c d 


R\ <— > i?2 


c d 
0 & 


1 4 

C 

0 1 


1 0 
0 1 


So in both cases, A has reduced row-echelon form equal to 
11. We simplify the augmented matrix of the system using row operations: 


1 0 
0 1 


' 1 

2 

-3 

4 

i?2- 

i?2 

- 3R\ 

' 1 

2 -3 

4 

3 

-1 

5 

2 

0 

-7 14 

-10 

4 

1 

a 2 — lA a + 2 

i?3 - 

i? 3 

- 4Ri 

0 

-7 a 2 - 2 a - 14 J 


R -3 ~ 

* R3 - R2 

' 1 

2-3 4 


R‘2 


0 

1 -2 f 

-Ri — ■> Ri — 2R2 


S3 

1 

to 

S 3 

to 

0 

0 a 2 — 16 a — 4 



1 0 1 
0 1 -2 

0 0 a 2 — IQ a — 4 


Denote the last matrix by B. 
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Case 1: a 2 — 16 / 0. i.e. a / ±4. Then 


~ > a 2 -16^ 3 

Ri — ► -Rl — ^3 
i?2 — 1 ► R2 + 2i?3 


1 0 0 
0 1 0 
0 0 1 


8a+25 1 
7(a+4) 
10a+54 
7(a+4) 

a+4 


and we get the unique solution 


_ 8a + 25 
" 7(o + 4)’ 


10a + 54 1 

7(a + 4) ’ ~ “ a + 4' 


Case 2: a = —4. Then B = 


1 0 1 


0 1 -2 f 


1 1 


0 0 


0 -s 


Case 3: a = 4. Then B = 


0 1 
1 -2 


i2 

7 

0 0 0 


consistent, with complete solution x 
arbitrary. 


, so our system is inconsistent. 

We read off that the system is 
| — z, y = y + 2+ where z is 


12. We reduce the augmented array of the system to reduced row-echelon 
form: 



' 1 

0 

1 

0 

1 ' 



' 1 

0 

1 

0 

1 ' 




0 

1 

0 

1 

1 

R3 * 

R3 + R\ 

0 

1 

0 

1 

1 




1 

1 

1 

1 

0 

0 

1 

0 

1 

1 




_ 0 

0 

1 

1 

0 _ 



_ 0 

0 

1 

1 

0 _ 





' 1 

0 

1 

0 1 ' 




' 1 

0 

0 

1 

1 ' 



0 

1 

0 

1 1 

R^ — > R 1 + Ra 

0 

1 

0 

1 

1 

-»• ^3 + +2 

0 

0 

0 

0 0 

R3 

R4 


0 

0 

1 

1 

0 



0 

0 

1 

1 0 




0 

0 

0 

0 

0 


The last matrix is in reduced row-echelon form and we read off the solution 
of the corresponding homogeneous system: 


x\ = — X4 — x§ = X4 + X'5 
x 2 = — X4 - x 5 = x 4 + X'5 
X3 = — X4 = X4, 
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where xq and X5 are arbitrary elements of Z2. Hence there are four solutions: 


X'l X 2 x 3 X 4 X '5 

0 0 0 0 0 

110 0 1 
11110 
0 0 111 


13. (a) We reduce the augmented matrix to reduced row-echelon form: 


' 2 

1 

3 

4 ' 



■ 1 

3 

4 

2 ' 

4 

1 

4 

1 

R\ - 

-> 3i?i 

4 

1 

4 

1 

3 

1 

2 

0 



3 

1 

2 

0 


i ?2 — ’ > + i?l 
R3 R3 + 2 i?i 


' 1 

3 

4 

2 " 



" 1 

3 

4 

2 ' 

0 

4 

3 

3 

i? 2 - 

x 4i?2 

0 

1 

2 

2 

0 

2 

0 

4 



0 

2 

0 

4 


— ■> R\ + 2i?2 

i?3 — > i?3 + 3i?2 


10 3 1 
0 12 2 
0 0 10 


Ri 

R-2 


R\ + 2 i ?3 

+ 3i?3 


10 0 1 
0 10 2 
0 0 10 


Consequently the system has the unique solution x = 1, y = 2, z = 0. 

(b) Again we reduce the augmented matrix to reduced row-echelon form: 


' 2 

1 

3 

4 ' 


■ 1 

1 

0 

3 " 

4 

1 

4 

1 

R\ «-»• 7?3 

4 

1 

4 

1 

1 

1 

0 

3 _ 


2 

1 

3 

4 


R -2 R2 + R\ 

R 3 — > R 3 + 3i?i 


R\ — > R\ + 4i?2 

R3 — > R3 + R2 

We read off the complete solution 


' 1 

1 

0 

3 ' 



' 1 

1 

0 

3 ' 

0 

2 

4 

4 

r 2 - 

3i?2 

0 

1 

2 

2 

0 

4 

3 

3 



0 

4 

3 

3 


10 3 1 
0 12 2 
0 0 0 0 


x = 1 — 3z = 1 + 2z 
y = 2 — 2z = 2 + 3 z, 


where z is an arbitrary element of Z 5 . 
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14. Suppose that (ou, . . . , a n ) and (fi\ , . . . , (3 n ) are solutions of the system 
of linear equations 

n 

cyyxy = bi, 1 < i < m . 

1= i 

Then 

n n 

yy dijCtj = bi and yy a ij@j = &£ 
i=i 1=1 

for 1 < i < m. 

Let j = (1 — t)oti + tfii for 1 < i < m. Then ( i, . . . , n ) is a solution of 
the given system. For 

n n 

°V 3 = Ml 1 ~ *)“! + W 

1= 1 3 = 1 

n n 

— y ' t)a.j T y ' a,ijt(3j 

3 = 1 l=i 

= (1 - t)h + tbi 

= k. 


15. Suppose that (oq, . . . , a n ) is a solution of the system of linear equations 

n 

yy a*yxy = bi, 1 < i < m. (1) 

i=i 

Then the system can be rewritten as 
n n 

yy dijXj = yy « ,y ay , 1 < * < m, 

1=1 1=1 

or equivalently 

n 

yy ciij(xj — aj ) = 0, 1 < i < m. 

1=1 

So we have 

n 

yy aijyj = 0, 1 < i < m. 

1=1 

where xy — ay = yy. Hence xy = ay + yy, 1 < y < n, where (yi, . . . , y n ) is 
a solution of the associated homogeneous system. Conversely if (yi, . . . , y n ) 



is a solution of the associated homogeneous system and Xj = ay + Dj , 1 < 
j < n, then reversing the argument shows that (x \, .. , , x n ) is a solution of 
the system 1 . 

16. We simplify the augmented matrix using row operations, working to- 
wards row-echelon form: 


11-11 1 
a 1 11 6 

3 2 0 a 1 + a 

R-2 i?3 

i?2 — * > — i?2 


R'3 ~ 1 " R'i + ( a — 1)^2 
Case 1: a / 2. Then 4 — 2a 


R-2 — ► R? — uR\ ^ , 

^3 * R 3 3i?i q _ 

'll -1 1 

0 1 -3 3 -a 

0 1 — a 1 + a 1 — a 

11-1 1 
0 1 -3 3 - a 

0 0 4 — 2a (1 — a) (a - 

^4 0 and 


-1 1 1 
-a 1 + a 1 — a b — a 
1 3 a — 3a — 2 

1 

2 — a 
b — a 

1 

2 — a 

2) —a 2 + 2a + b — 2 


5 


1 1 
0 1 
0 0 


-1 1 

-3 3 — a 

1 CL—1 

1 2 


1 

2 — a 

— a 2 +2a+b — 2 
4— 2a 


Hence we can solve for x, y and z in terms of the arbitrary variable w. 


Case 2: a = 2. Then 


B = 


11-11 1 
01-31 0 

00 00 6—2 


Hence there is no solution if b ^ 2. However if 6 = 2, then 


' 1 

l 

-l 

1 

1 ' 


' 1 

0 

2 

0 

1 ' 

0 

l 

CO 

1 

1 

0 

-> 

0 

1 

1 

CO 

1 

0 

1 

O 

0 

0 

0 

0 


1 

o 

0 

0 

0 

1 

o 


and we get the solution x = 1 — 2 z, y = 3z — w, where w is arbitrary. 


17. (a) We first prove that 1 + 1 + 1 + 1=0. Observe that the elements 
1 + 0, 1 + 1, 1 + a, 1 + 6 
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are distinct elements of F by virtue of the cancellation law for addition. For 
this law states that 1 + x = 1 + y ^ x = y and hence x^y=>l + x^l + y. 

Hence the above four elements are just the elements 0, 1, a, b in some 
order. Consequently 

(1 + 0) + (1 + 1) + (1 + a) + (1 + b) = 0 + 1 + a + b 

(1 + 1 + 1 + 1) + (0+l + o + b) = 0 + (0 + l + a + 6), 

sol + l + l + l = 0 after cancellation. 

Now 1 + 1 + 1 + 1 = (1 + 1)(1 + 1), so we have x 2 = 0, where x = 1 + 1. 
Hence x = 0. Then a + a = a(l + 1) = a • 0 = 0. 

Next a + b = 1. For a + b must be one of 0, 1, a, b. Clearly we can’t 
have a + b = a or 6; also if a + b = 0, then a + b = a + a and hence b = a; 
hence a + b = 1 . Then 


a T 1 — a T (n T 6) — (u T a) F b — 0T^ — b. 


Similarly b + 1 = a. Consequently the addition table for F is 


+ 

0 

1 

a 

b 

0 

0 

1 

a 

b 

1 

1 

0 

b 

a 

a 

a 

b 

0 

1 

b 

b 

a 

1 

0 


We now find the multiplication table. First, ab must be one of 1, a, b; 
however we can’t have ab = a or b, so this leaves ab = 1. 

Next a 2 = b. For a 2 must be one of 1, a, 6; however a 2 = a => a = 0 or 
a = 1; also 

a 2 = 1=^ a 2 — 1 = 0 =>- (a — l)(a + 1) = 0 (a — l) 2 = 0 =>• a = 1; 

hence a 2 = b. Similarly b 2 = a. Consequently the multiplication table for F 
is 


X 

0 

1 

a 

b 

0 

0 

0 

0 

0 

1 

0 

1 

a 

b 

a 

0 

a 

b 

1 

b 

0 

b 

1 

a 

We 

use 

the 

additio] 




' 1 

a 


A 

= 

a 

b 




1 

1 


Ik - 

r 3 


i?2 T 0:R\ 

> i?3 + i?l 


a 

a 

0 
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i?2 R 3 


' 1 

a 

6 

a 

i?2- 

-> ai? 2 

' 1 

a 

6 

a 

0 

6 

a 

0 

0 

1 

6 

0 

0 

0 

a 

a 

R3 - 

->• 6/1*3 

0 

0 

1 

1 _ 


10 0 0 
0 10 6 
0 0 11 

The last matrix is in reduced row-echelon form. 


R\ clR 2 


1 0 a a 
0 16 0 
0 0 11 


R\ — > Ri + C1R3 

R2 —■ > R2 + bR,i 
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Section 2.4 


a b 

2. Suppose B = c d and that AB = I 2 . Then 
. e / _ 

—101 — a + e —b + f 

0 loj C( ~|_0 1 J ~ [ c + e d+f ' 

L e J J 

Hence 

— a + e = 1 — 6 + / = 0 

c + e = 0 ’ d+ f = 1 ’ 

e = a + l f = b 

c = — e = —(a +1) ’ d = 1 — / = 1 — 6 ’ 

a b 

B = —a — 1 1 — 6 
a + 1 6 

Next, 

(BA) 2 B = (BA)(BA)B = B(AB)(AB) = BI 2 I 2 = BI 2 = B 

4. Let p n denote the statement 

A" = (3Lpi) a + ®=pl 2 . 

Then p\ asserts that A = - 3 ~ 1 j A + ^p-I 2 , which is true. So let n > 1 and 
assume p n . Then from (1), 

A n+1 = = H lA+&=pI 2 =(^A 2 + ^H 

= - 3 1 2 ) + ^Ap-A = (3"-uy3-3") A+ (3"-i)(-3) /2 

(4-3™— 3 n )— 1 4 , (3— 3 n+1 ) T 

— 2 ' 2 J2 

= (3^1)a+^^)/ 2 . 

Hence p n +i is true and the induction proceeds. 

5. The equation x n+ \ = ax n + bx n -\ is seen to be equivalent to 

x n -\- 1 Cl b X n 

x n J L 1 0 J L x n- 1 
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or 


X,„ — AXn.-A 


where X„ = 


%n + 1 


and A = 


a b 
1 0 


Then 


Xn = A n X 0 


if n > 1. Hence by Question 3, 


^n+1 

= ((3"-1) j 4+ ( 3-3") j 

Xl 


l 2 2 J 

XO 


(3 n — 1) I" 4 -3 
2 [ 1 0 
(3 n - 1)2+ ^ 

3 n — 1 


+ 


3—3" 


0 


3—3" 


(3 n — 1)(— 3) 

3—3" 


n 

Xl 

Jj 

_ x 0 

Xl 


X 0 



Hence, equating the (2,1) elements gives 

(3 n - 1) (3 - 3 n ) 

X n = 7T Xi H Xq 


if n > 1 


7. Note: X\ + X 2 = a + d and A 1 A 2 = ad — be. 
Then 


(Ai + X2)k n - XiX2k n -i — (Ai + A2)(A” 1 + A” 2 A2 + --- + AiA2 2 + A 2 3 ) 

-AiAsCA) 1 - 2 + A”- 3 A 2 + • • • + AiA”- 3 + A"’ 2 ) 


= (A” + X?- 1 X 2 + • • • + AiA”- 1 ) 

+(A?" 1 A 2 + • • • + AiAir 1 + A 5 ) 

— (A" 1 X 2 + • • • + A1A2 3 ) 

= A" + A” 3 A 2 + • • • + A1A2 3 + A[> = k n + 1 

If Ai = A 2 , we see 

kn = A ”- 1 + A"- 2 A 2 + • • • + AiA ^ 2 + A "- 1 
= A ”” 1 + A" _ 2 Ai + • • • + AiA ” -2 + A ”” 1 
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If Ai / A 2 , we see that 


(Ai - A 2 )(A ”- 1 + A?- 2 A 2 + • • • + AiA ^ 2 + Ar 1 ) 
a; 1 + a^- 1 a 2 + • • • + AiA”- 1 
— (A" _ 1 A 2 H f- A1A2 -1 + A2) 

\n \n 
Ai - A 2 . 


A " — k n A — AiA 2 fc n _i/ 2 . * 

A 1 = A] also k\A — AiA 2 /co / 2 = k\A — AiA 2 0/ 2 

= A. 

Let n > 1 and assume equation * holds. Then 

A n+1 = A n -A = (k n A - A 1 A 2 fc n _i/ 2 )yl 

= k n A XiX2k n -iA. 

Now A 2 = (a + d)A — (ad — 6c)/ 2 = (Ai + A 2 )H — AiA 2 / 2 . Hence 

A n+1 = fc n (Ai + A 2 )H - AiA 2 / 2 - AiA 2 fc n _iH 

= {&n(Ai + A 2 ) — AiA 2 fc n _i}H — \1\2knh 

— kn+iA AiA 2 fc n / 2 , 

and the induction goes through. 

8. Here Ai, A 2 are the roots of the polynomial x 2 — 2x — 3 = (x — 3)(x + 1). 
So we can take Ai = 3, A 2 = — 1. Then 


Hence 

A n = 




kn — 


3 n — (— 1)” _ 3 n + (— l) n+1 
3-(-l) I 


3 " + (- 1 )" + M A _ ( _ 3 ) f 3"- 1 + (- l )" K2 


3 n + (~l) n+1 
4 


' 1 2 

|3»- + (-!)» 1 

' 1 0 ' 

2 1 

l 4 / 

0 1 


(Ai — A 2 )k n — 


\ n \n 

Hence k n = — — - 


Ai— A 2 
We have to prove 


n=l: 
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which is equivalent to the stated result. 
9. In terms of matrices, we have 


F n +i 


' 1 1 ' 

F n 

F n 


1 0 

F n ~ 1 


for n > 1. 


F n +i 


' 1 

1 ' 

n 

' F\ ' 


' 1 

1 ' 

n 

' 1 ' 

F n 


1 

0 


_ F o _ 


1 

0 


0 


Now Ai, A 2 are the roots of the polynomial x 2 — x — 1 here. 
Hence Ai = 1+ /^ and A 2 = 1 ~ 9 V ^ and 


k n = 




1+V5 

2 

n— 1 


1— \/5 
2 


W-(^ 

C5 


n— 1 


n— 1 


Hence 


H" — k n A — \i\-2k n -il2 

= + k n —\l2 


So 


F, 


n+l 


— ( k n A H~ k n —\l2) 


— kn 


+ ^n-1 


1 

0 

1 

0 


k n + k n — 1 

kn. 


Hence 


F — k — 

1 n — h n — 




V5 


10. From Question 5, we know that 




' 1 

r 

n 

a 

_ Vn _ 


1 

1 


b 
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Now by Question 7, with A = 


1 r 
1 1 ’ 


A n — k n A — A1A2 k n - 1/2 
= k n A - (1 - r)k n -ih, 

where Ai = 1 + y/r and A 2 = 1 — \]r are the roots of the polynomial 
x 2 — 2x + (1 — r) and 

\n \n 
_ A, - A 2 

h l> — r, r • 

2 y/r 


Hence 


= (k n A - (1 - r)k n -ih) 


k n r 

1 

e 

1 

T— 1 

1 

0 

kn 

L 0 

(1 - r)k n - 1 _ 

(1 - r)k n - 
k n 

1 k n r 1 T a 

k n - (1 - r)k n -i \ [ b _ 


a(k n - (1 - r)k n -i) + bk n r 
ak n + b(k n - (1 - r)k n -\) 


Hence, in view of the fact that 

k n _ A? - A 2 _ A?(l -{£}") 

kn- 1 A?- 1 - Ar 1 A" _1 (l - {y 2 }™ -1 ) ~ * l! aSn ^°°’ 


we have 


a(k n - (1 - r)k n -i) + bk n r 
ak n + b(k n - (1 - r)k n - 1 ) 

“ib + ‘(ib - - r » 

a(Ai - (1 - r)) + 6Air 
aAi + 6(Ai - (1 - ?’)) 

a(y/r + r) + 6(1 + y/r)r 
a(l + y/r) + b(y/r + r) 

Vr{a( 1 + y/r) + 6(1 + y/r) y/r} 

o(l + y/r) + b{y/r + r) 

= y/r. 
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Section 2.7 


1. [A\I 2 ] = 
i?2 — » 


1 4 

1 0 ' 


r i 4 

1 0 ' 

3 1 

0 1 

R 2 — > R 2 + 3i?i 

0 13 

3 1 


1 4 
0 1 


1 0 
3/13 1/13 


R\ — > R\ — 47? 2 


1 0 
0 1 


1/13 

3/13 


Hence A is non-singular and A 1 
Moreover 


1/13 -4/13 
3/13 1/13 


-E'i2(— 4)i?2(l/13)-E , 2i(3)^4 — I 2 , 


— 4/13 
1/13 


so 

A- 1 = E 12 (-4)E 2 (1/13)E 21 (3). 


Hence 

A = {E 21 (3)}- 1 {E 2 (l/13)}- 1 {E 1 2(-4)}- 1 = E 21 (-3)E 2 (13)E 12 (4). 


2. Let D = [ dij ] be an m x m diagonal matrix and let A = [ a,jk ] be an m x n 
matrix. Then 

n 

f RA j jfr / ' dij Vjj k diiCLik, 

3 = 1 

as dij = 0 if i ^ j. It follows that the ith row of DA is obtained by 
multiplying the ith row of A by da. 

Similarly, post-multiplication of a matrix by a diagonal matrix D results 
in a matrix whose columns are those of A. multiplied by the respective 
diagonal elements of D. 

In particular, 

diag (ai, . . . , a n )diag (&i, . . . , b n ) = diag (ai&i, . . . , a n b n ), 

as the left-hand side can be regarded as pre-multiplication of the matrix 
diag (&i, . . . , b n ) by the diagonal matrix diag (a±, . . . , a n ). 

Finally, suppose that each of ai, . . . , a n is non-zero. Then a/ 1 , . . . , a~ l 
all exist and we have 

diag(ai,...,a n )diag(a/ 1 ,...,a“ 1 ) = diag (aia/ 1 , . . . , a n a~ l ) 

= diag (1, . . . , 1) = I n . 

Hence diag (a±, . . . , a n ) is non-singular and its inverse is diag (a/ 1 , . . . , a” 1 ). 
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Next suppose that a* = 0. Then diag (ai, . . . , a n ) is row-equivalent to a 
matix containing a zero row and is hence singular. 



' 0 

0 

2 

1 

0 

0 ' 




" 1 

2 

6 

0 

1 

0 ' 




3. [A|I 3 ] = 

1 

2 

6 

0 

1 

0 

R\ 

7? 2 

0 

0 

2 

1 

0 

0 





_ 3 

7 

9 

0 

0 

1 _ 




_ 3 

7 

9 

0 

0 

1 _ 







' 1 

2 

6 

0 

l 

0 ' 





" 

1 

2 

6 

0 

1 

0 

S 3 

CO 

1 

S3 

CO 

1 

37?i 


0 

0 

2 

1 

0 

0 

R'2 

! * 

7?3 



0 

1 

-9 

0 

-3 

1 




0 

1 

-9 

0 

-3 

1 






0 

0 

2 

1 

0 

0 


i?3 — > 


■ 1 

2 

6 

0 

1 

0 ' 



' 1 

0 

24 

0 

7 

0 

1 

-9 

0 

-3 

1 

R\ - 

7?i — 2 R 2 

0 

1 

-9 

0 

-3 

0 

0 

1 

1/2 

0 

0 



0 

0 

1 

1/2 

0 


7?1 - 
R 2 


Ri — 24i?3 

> R'2 + 9i?3 


1 0 0 -12 7 -2 

0 10 9/2-3 1 

0 0 1 1/2 0 0 


Hence A is non-singular and A 1 = 


-12 

9/2 

1/2 


7 -2 
-3 1 

0 0 


Also 


^23 (9) 71/13 ( 24)i?i2( 2).E/3(l/2)ii/23-E'3i( 3)ii/i2 A — I 3 . 


-2 

1 

0 


Hence 


A 1 — 7?23(9)7?i 3 (-24)^i2(-2)i73(l/2)7^237^3i(-3)7i;i2, 


so 

A = Ei2E3i(3)E23Es(2)Ei2(2)Ei3(24)E23(— 9). 


4. 



' 1 

2 

k ' 


' 1 

2 

k ' 


' 1 

2 

fc ' 

A = 

3 

-1 

1 

-> 

0 

-7 

1 - 3 k 

-> 

0 

-7 

1 - 37c 


5 

3 

-5 


0 

-7 

-5 - 5k 


0 

0 

1 

CM 

1 

<X> 

1 


Hence if —6 — 2k / 0, i.e. if k ^ —3, we see that 7? can be reduced to I 3 
and hence A is non-singular. 


If k = —3, then B 


1 2 -3 

0 -7 10 

0 0 0 


= B and consequently A is singu- 


lar, as it is row-equivalent to a matrix containing a zero row. 
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5. #21 (2) 


r 1 

to 

1 


r 1 2 1 

CM 

1 

-4 


0 

0 


Hence, as in the previous question, 


1 2 
-2 -4 


is singular. 

6 . Starting from the equation A 2 — 2 A + 13/2 = 0, we deduce 

A(A - 2 1 2 ) = —13 1 2 = {A - 2 1 2 )A. 

Hence AB = BA = I 2 , where B = -[j(H — 2 / 2 )- Consequently A is non- 
singular and A~ 4 = B. 

7. We assume the equation A 3 = 3 A 2 — 3 A + I 3 . 

(ii) A 4 = A 3 A = (3 A 2 -3A + I 3 )A = 3 A 3 - 3 A 2 + A 

= 3(3A 2 - 3A + 1 3 ) - 3A 2 + A = 6A 2 - 8A + 3I 3 . 

(iii) A 3 — 3 A 2 + 3 A = I 3 . Hence 

A(A 2 - 3H + 3 / 3 ) =h = ( A 2 - 3H + 3/ 3 M. 

Hence A is non-singular and 


A 


-1 


= A 2 - 3H + 3/3 
-1-3 1 

2 4-1 

0 1 0 


8. (i) If B 3 = 0 then 

(I n -B)(I n + B + B 2 ) = I n (I n + B + B 2 ) - B(I n + B + B 2 ) 

= (I n + B + B 2 )-(B + B 2 + B 3 ) 

= I n - B 3 = I n - 0 = I n . 

Similarly (/„ + B + B 2 )(I n - B) = I n . 

Hence A = I n — B is non-singular and A = I n + B + B 2 . 

It follows that the system AX = b has the unique solution 

X = A~ 4 b = {I n + B + B 2 )b = b + Bb + B 2 b. 


19 



0 r s 0 0 rt 

(ii) Let B = 0 0 t Then B 2 = 0 0 0 and B 3 = 0. Hence 

_ 0 0 0 J [ 0 0 0 _ 

from the preceding question 

(■ h~B r 1 = h + B + B 2 

100 0 r s 00 rt 

= 010 + 00f + 000 

_ 0 0 1 J [ 0 0 0 J [ 0 0 0 _ 

1 r s + rt 

0 1 t 
0 0 1 

9. (i) Suppose that A 2 = 0. Then if A ~ 1 exists, we deduce that A~ l (AA) = 
A~ 1 0, which gives A = 0 and this is a contradiction, as the zero matrix is 
singular. We conclude that A does not have an inverse. 

(ii). Suppose that A 2 = A and that A~ 2 exists. Then 

A~\AA) = A~ 2 A, 

which gives A = I n . Equivalently, if A 2 = A and A ^ J n , then A does not 
have an inverse. 

10. The system of linear equations 

x + y — z = a 
z = b 
2x + y + 2z = c 

is equivalent to the matrix equation AX = B, where 



By Question 7, A 1 exists and hence the system has the unique solution 

—1—3 1 a —a — 3b + c 

X = 2 4-1 b = 2a + Ab — c . 

0 1 0 J [ c J |_ b 

Hence x = —a — 3 b + c, y = 2a + 46 — c, z = b. 
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12. 


Also 


A 


^3(2)A'l4A , 42(3) — £ , 3(2)£’i4 


10 0 0 
0 10 0 
0 0 10 
0 3 0 1 


£s(2) 


0 

3 

0 

1 " 


0 

3 

0 

1 ' 

0 

1 

0 

0 


0 

1 

0 

0 

0 

0 

1 

0 


0 

0 

2 

0 

_ 1 

0 

0 

0 


_ 1 

0 

0 

1 

O 


A" 1 


(^3(2)^14^42(3)) 1 
(i?42(3))- 1 i?r4 1 (^3(2))- 1 
^42 ( 3)£/i4^3(l/2) 


E42(—3)Ei4 


1 0 0 

0 1 0 

0 0 1/2 

0 0 0 


0 

0 

0 

1 


A'42( — 3) 


0 0 0 1 
0 10 0 
001/20 
10 0 0 


0 0 0 1 
0 10 0 
001/20 
1-300 


13. (All matrices in this question are over Z 2 .) 


' 1 

1 

0 

1 

1 

0 

0 

0 ' 


■ 1 

1 

0 

1 

1 

0 

0 

0 

0 

0 

1 

1 

0 

1 

0 

0 


0 

0 

1 

1 

0 

1 

0 

0 

1 

1 

1 

1 

0 

0 

1 

0 


0 

0 

1 

0 

1 

0 

1 

0 

_ 1 

0 

0 

1 

0 

0 

0 

1 _ 


_ 0 

1 

0 

0 

1 

0 

0 

1 

■ 1 

1 

0 

1 

1 

0 

0 

0 ' 


■ 1 

0 

0 

1 

0 

0 

0 

1 

0 

1 

0 

0 

1 

0 

0 

1 


0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 


0 

0 

1 

0 

1 

0 

1 

0 

1 

0 

0 

1 

1 

0 

1 

0 

0 _ 


0 

0 

0 

1 

1 

1 

1 

0 


21 



' 1 

0 

0 

0 

1 

1 

1 

1 ' 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 

1 

0 

0 

0 

1 

1 

1 

1 

1 

0 


Hence A is non-singular and 


A~ l = 


1111 
10 0 1 
10 10 
1110 


(b) A = 


14. 


(a) 


' 1 

1 

0 

1 ' 



' 1 

1 

0 

1 ' 

0 

1 

1 

1 

i?4 — 

R 4 + R\ 

0 

1 

1 

1 

1 

0 

1 

0 

1 

0 

1 

0 

_ 1 

1 

0 

1 _ 



_ 0 

0 

0 

0 _ 


1 1 1 

-1 1 0 

2 0 0 


R3 — >• R3 ~ R2 


Hence A 1 exists and 


1 0 0 
0 1 0 
0 0 1 


R3 


R\ 

R2 


Ri 


- 2^3 
R\ — R 3 

R2 + R3 
R 3 


' 1 0 

0 

0 

0 

1/2 ' 

0 1 

0 

0 

1 

1/2 

. 0 0 

1 

1 

-1 

-1 _ 

A ~ 1 


' 0 

0 

1/2 ' 

= 

0 

1 

1/2 



1 

-1 

-1 


, so A is singular. 


1 0 0 
0 1 0 
0 1 1 


0 0 1/2 
0 1 1/2 
1 0 - 1/2 


' 2 

2 

4 

1 

0 

0 ' 

R 1 — > Ri — 2i?2 

' 1 

0 

1 

0 

1 

0 ' 

1 

0 

1 

0 

1 

0 

R\ <— > R 2 

0 

1 

0 

0 

0 

1 

0 

1 

0 

0 

0 

1 

i?2 R 3 

0 

2 

2 

1 

-2 

0 


(b) 


R3 — ► R3 ~ 2R2 


' 1 

0 

1 

0 

1 

1 

O 

0 

1 

0 

0 

0 

1 

1 

O 

0 

2 

1 

CN 

1 

1 

Cd 

1 


R -3 — > \R 3 


' 1 

0 

1 

0 

1 

0 ' 

0 

1 

0 

0 

0 

1 

0 

0 

1 

1/2 

-1 

-1 
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' 1 

0 

0 

-1/2 

2 

1 ' 

Ri —y R\— 

0 

1 

0 

0 

0 

1 


0 

0 

1 

1/2 

-1 

-1 


Hence A 1 exists and 


H " 1 


- 1/2 2 1 

0 0 1 

1/2 -1 -1 


' 4 

6 

-3 ' 

r 2 - 

Rs- 

-y hRz 
-4^3 

' 4 

6 

-3 ' 



' 4 

6 

-3 ' 

0 

0 

0 

0 

7 

5 

0 

0 

0 

0 

1 

1 

Rs- 

-y R ?j — R ‘2 

0 

0 

0 

0 

1 

0 


(c) 


Hence A is singular by virtue of the zero row. 



' 2 

0 

0 

1 

0 

0 ' 

Ri- 

\Ri 

' l 

0 

0 

1/2 

0 

0 ' 

(d) 

0 

-5 

0 

0 

1 

0 

R -2 ~ 

► =^R2 
-y \r 3 

0 

1 

0 

0 

-1/5 

0 


0 

0 

7 

0 

0 

1 

7*3- 

0 

0 

1 

0 

0 

1/7 _ 


Hence A 1 exists and A 1 = diag(l/2, —1/5, 1/7). 

(Of course this was also immediate from Question 2.) 


' 1 

2 

4 

6 

1 

0 

0 

0 ' 



' 1 

0 

0 

6 

1 

-2 

0 

0 ' 

0 

1 

2 

0 

0 

1 

0 

0 

Ri- 

-y Ri — 2i?2 

0 

1 

2 

0 

0 

1 

0 

0 

0 

0 

1 

2 

0 

0 

1 

0 

0 

0 

1 

2 

0 

0 

1 

0 

0 

0 

0 

2 

0 

0 

0 

1 



0 

0 

0 

2 

0 

0 

0 

1 


1-2 0 0 
0 1-2 0 

0 0 10 

0 0 0 1 


R\ ^ 

R\ — 3 i ?4 

' 1 

0 

0 

0 

1 

-2 

0 

-3 ' 


i ?2 A 2 i ?4 

0 

1 

0 

0 

0 

1 

-2 

2 

r 3 - 

ftj 

1 

CO 

0 

0 

1 

0 

0 

0 

1 

-1 

i ?4 

> 2 ^4 

0 

0 

0 

1 

0 

0 

0 

1/2 _ 


Hence A 1 exists and 


R 2 R 2 ~ 2i?3 


1 0 0 
0 1 0 
0 0 1 
0 0 0 


6 

-4 

2 

2 


A~ l 


1-2 0-3 

0 1-2 2 

0 0 1-1 

0 0 0 1/2 
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(f) 


1 2 3 

0 -3 -6 
0 0 0 

Hence A is singular by virtue of the zero row. 

15. Suppose that A is non-singular. Then 

AA - 1 = I n = A~ 2 A. 


1 2 3 

4 5 6 

5 7 9 


Hi y 7?2 — 4i?i 
R$ —>■ Rz — 5i?i 


1 2 3 

0 -3 -6 
0 -3 -6 


R3 ► R 3 ~ R'2 


Taking transposes throughout gives 

{AA-y = I t n = (A~ 1 A) t 
(A~ 1 ) t A t = I n = A\A~ l )\ 


so A t is non-singular and (A*) 1 = [A 1 ) t . 


16. Let A 


a b 
c d 


where ad — be = 0. Then the equation 


A 2 — (a + d)A + (ad — be) I 2 = 0 


reduces to A 2 — (a + d)A = 0 and hence A 2 = (a + d)A. From the last 
equation, if A exists, we deduce that A = (a + d)l 2 , or 


a 6 


a + d 

0 

c d 


0 

a + d 


Hence a = a + d, 6 = 0, c = 0, d = a + d and a = b = c = d = 0, which 
contradicts the assumption that A is non-singular. 


17. 


A = 


1 a b 
—a 1 c 
—6 — c 1 


R '2 — > R '2 + oR\ 

R .3 — > R 3 + bR,\ 


R-2 -»• R‘2 


la b 

0 1 + a 2 c + ab 

0 ab — c 1 + b 2 

1 a 

0 1 

0 ab — c 


b 

c-\-ab 

1+a 2 

1 + 6 2 


R 3 — ^ ” R 3 - (ab - c)R 2 


1 

0 

0 


a b 

-1 c-\-ab 

1+a 2 

0 1 + b 2 + A-abUc+ab) 
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Now 


1 + b 2 + 


(c — ab)(c + ab ) 
1 + a 2 


2 c 2 — (ab) 2 
l + b 2 + H v ' 


1 + a 2 + 6 2 + c 2 

1 + a 2 


/0. 


Hence B can be reduced to -Z 3 using four more row operations and conse- 
quently A is non-singular. 

18. The proposition is clearly true when n = 1. So let n > 1 and assume 
(P~ 1 AP) n = P~ 1 A n P. Then 


(P~ l AP) n+1 


(P~ l AP) n (P~ l AP) 
(. P~ 1 A n P)(P~ 1 AP ) 
P~ 1 A n (PP~ 1 )AP 
P~ 1 A n IAP 
P~ 1 (A n A)P 

p~l A n+lp 


and the induction goes through. 

4 -3 
1 1 


We then verify that P 1 AP = 
tion, 


5/12 0 
0 1 


Then from the previous ques- 


19. Let A = 


2/3 

1/4 

and P = 

1 3 

. Then P ” 1 = \ 

1/3 

3/4 


-1 4 



P~ 1 A n P = ( p- l AP) n 


' 5/12 

0 ' 

n 

' (5/12)" 

0 


' (5/12)" 0 ' 

0 

1 


0 

1 " 


0 1 


Hence 


A n 


P 

1 

7 

1 

7 


(5/12) n 

0 


0 ' 

p- 1 = 

1 3 ' 

' (5/12)" 0 ' 

1 

1 

CO 

1 

1 


-1 4 

0 1 

7 

Li 1 J 


(5/12) n 

-(5/12)" 


3 

4 


4 -3 

1 1 


4(5/12)" + 3 (— 3)(5/12)" + 3 

-4(5/12)" + 4 3(5/12)" + 4 


1 

7 


3 3 

4 4 


+ 1 ( 5 / 12 )” 


4 -3 
-4 3 
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Notice that A r 


3 3 

4 4 


as n — > oo. This problem is a special case of 


a more general result about Markov matrices. 
a 6 


20. Let A = , 

c a 

numbers satisfying 


be a matrix whose elements are non-negative real 


a > 0, 6 > 0, c > 0, d > 0, a + c= l = b + d. 


Also let P = 


and suppose that A / I 2 . 


b 1 
c —1 

(i) det P = —b — c = —(6 + c). Now b + c > 0. Also if 6 + c = 0, then we 
would have b = c = 0 and hence d = a = 1, resulting in A = / 2 . Hence 
det P < 0 and P is non-singular. 

Next, 


P _1 AP = 


-1 
b + c 


-1 -1 

— c & 


a 6 
c d 


b 1 
c —1 


-1 

—a — c 

-b-d 

' 6 

1 ' 

6 + c 

—ac + be 

— cb + bd 

c 

-1 

-1 

-1 

-1 

' 6 

1 ' 

6 + c 

— ac + be 

—cb + bd 

c 

-1 

-1 

r 

—b — c 



6 + c 

(— ac + 6c)6 + (—cb + bd)c 

— ac 


Now 


—acb + b 2 c — c 2 b + bdc = — cb{a + c) + bc{b + d ) 

= —cb + bc= 0. 


Also 


— (a + d — l)(b + c) = —ab — ac — db—dc+b + c 

= —ac + 6(1 — a) + c(l — d) — bd 
= — ac + 6c + c6 — bd. 


Hence 


P~ X AP 


' — (6 + c) 

0 


' 1 0 

0 

—(a + d— l)(6 + c) 


0 a + d — 1 
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(ii) We next prove that if we impose the extra restriction that A ^ 
then | a + d — 1| < 1. This will then have the following consequence: 


0 1 
1 0 


A = P 


A" = P 


= P 


0 

a + d — 1 
0 

a + d — 1 
0 

(a + d — 1) 


P 


-l 


P 


-l 


P 


-l 


p 

'10' 
0 0 

p-i 




' b 1 ' 


'10' 

-1 

' -1 -1 ' 


c —1 


0 0 

b + c 

— c b 


-1 
b + c 
-1 
b + c 
1 

b + c 


b 0 
c 0 

-6 - b 
— c — c 

b b 
c c 


-1 -1 

— c b 


where we have used the fact that (a + d — l) n — > 0 as n 
We first prove the inequality | a + d — 1 1 < 1 : 

a + d — 1 < 1 + d— l = d<l 
a + d-1 > 0 + 0 — 1 = — 1. 


oo. 


Next, if a + d — 1 = 1, we have a + d = 2;soa=l = d and hence c = 0 = b, 
contradicting our assumption that A ^ I^- Also if a + d — 1 = —1, then 

a + d = 0;soa = 0 = d and hence c = 1 = b and hence A = ? i • 


22. The system is inconsistent: We work towards reducing the augmented 
matrix: 


' 1 2 

4 

1 1 

5 

lO 

CO 

1 

12 . 


P‘2 d?2 — Rl 

R 3 Ri — 3-Ri 


' 1 

2 

1 

0 

-1 

1 

1 

O 

-1 

1 

O 



' 1 

2 

1 

R 3 ~ > ^3 ~ R‘2 

0 

-1 

1 


0 

0 

-1 
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The last row reveals inconsistency. 

The system in matrix form is AX = B, where 


A = 

'12" 
1 1 

, X = 

X 

, B = 

4 

5 


1 

kO 

CO 

1 


. y _ 


_ 12 _ 


The normal equations are given by the matrix equation 


A 1 AX = A 1 B. 


Now 


A 1 A = 


A f B = 


' 1 

1 

3 ' 


2 

1 

5 


' 1 

1 

3 ' 


2 

1 

5 



1 2 
1 1 

3 5 

4 

5 

12 


= 

' 11 
18 

18 ' 
30 





45 

73 


Hence the normal equations are 

llx+18 y = 45 
18x + 30y = 73. 


These may be solved, for example, by Cramer’s rule: 


x 


y 


45 

18 


73 

30 

36 

11 

18 

" “ 6 

18 

30 


11 

45 


18 

73 

-7 

11 

18 

" ““ IT 

18 

30 




23. Substituting the coordinates of the five points into the parabola equation 
gives the following equations: 

a =0 
a + b + c = 
a + 26 + 4c = 
ci T 3 b T 9c — 
a + 46 + 16c = 


28 


0 

-1 

4 



The associated normal equations are given by 

' 5 10 30 1 [ a 1 [ 11 

10 30 100 b = 42 , 

_ 30 100 354 J [ c J [ 160 _ 

which have the solution a = 1/5, b = —2, c = 1. 

24. Suppose that A is symmetric, i.e. A 1 = A and that AB is defined. Then 

(. B t AB) t = B t A\B t ) t = B f AB, 
so B f AB is also symmetric. 

25. Let Abe m x n and B be n x m, where m > n. Then the homogeneous 
system BX = 0 has a non-trivial solution Xq, as the number of unknowns 
is greater than the number of equations. Then 

(. AB)X 0 = A(BX 0 ) = ^0 = 0 

and the m x m matrix AB is therefore singular, as Xq ^ 0. 

26. (i) Let B be a singular n x n matrix. Then BX = 0 for some non-zero 
column vector X. Then ( AB)X = A(BX) = ^40 = 0 and hence AB is also 
singular. 

(ii) Suppose A is a singular n x n matrix. Then A t is also singular and 
hence by (i) so is B t A t = ( AB )L Consequently AB is also singular 
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Section 3.6 

1. (a) Let S be the set of vectors [x, y] satisfying x = 2 y. Then S' is a vector 
subspace of M 2 . For 

(i) [0, 0] G S as x = 2y holds with x = 0 and y = 0. 

(ii) S is closed under addition. For let [aq, y±] and [aq, 2 / 2 ] belong to S. 
Then x\ = 2y\ and aq = 2 2 / 2 . Hence 

x\ + x 2 = 2yi + 2 y 2 = 2(yi + y 2 ) 

and hence 

[xi + x 2 , yi + 2 / 2 ] = [xi, yi\ + [x 2 , 2 / 2 ] 

belongs to S. 

(iii) S is closed under scalar multiplication. For let [x, y\ G S and t G R. 
Then x = 2y and hence tx = 2(ty). Consequently 

[tx, ty\ = t[x, y] G S. 

(b) Let S be the set of vectors [x, y] satisfying x = 2y and 2x = y. Then S is 
a subspace of M 2 . This can be proved in the same way as (a), or alternatively 
we see that x = 2 y and 2x = y imply x = 4x and hence x = 0 = y. Hence 
S = {[0, 0]}, the set consisting of the zero vector. This is always a subspace. 

(c) Let S be the set of vectors [x, y] satisfying x = 2y + 1. Then S doesn’t 
contain the zero vector and consequently fails to be a vector subspace. 

(d) Let S be the set of vectors [x, y] satisfying xy = 0. Then S is not 
closed under addition of vectors. For example [1, 0] G S and [0, 1] G S, but 
[1, 0] + [0, 1] = [1, 1] 0 5. 

(e) Let S be the set of vectors \x, y\ satisfying x > 0 and y > 0. Then S is 
not closed under scalar multiplication. For example [1,0] G S and — 1 G M, 
but (-1) [1,0] = [-1,0] ^5. 

2. Let X , Y, Z be vectors in M n . Then by Lemma 3.2.1 

{X + Y, X + Z, Y + Z) C (X, Y, Z), 
as each of X + Y, X + Z, Y + Z is a linear combination of X, Y, Z. 
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Also 


X = 1 -(X + Y)+ 1 -(X + Z)-- 2 (Y + Z), 

Y = 1 -(X + Y)- 1 -(X + Z) + ±(Y + Z), 
z = ^(X + Y) + ^(X + Z) + - 2 (Y + Z), 
so 

(X, Y, Z) C (X + y, A + Z, Y + Z). 

Hence 

(X, Y, Z) = (X + Y, X + Z, Y + Z). 


3 . Let Ai = 

' 1 ' 
0 

1 

, *2 = 

' 0 ' 
1 

1 

and A3 = 

' 1 ' 
1 

1 

. We have to decide if 


2 


2 


3 



X\, X‘2- A3 are linearly independent, that is if the equation xX\ + 2/A2 + 
2A3 = 0 has only the trivial solution. This equation is equivalent to the 
folowing homogeneous system 


x + 0y + z = 0 

Ox + y + z = 0 
x + y + z = 0 

2x + 2y + 3z = 0. 

We reduce the coefficient matrix to reduced row-echelon form: 


' 1 

0 

1 ' 


' 1 

0 

0 

0 

1 

1 


0 

1 

0 

1 

1 

1 


0 

0 

1 

1 

to 

2 

1 

CO 


1 

0 

0 

1 

0 


and consequently the system has only the trivial solution x = 0, y = 0, z = 
0 . Hence the given vectors are linearly independent. 

4 . The vectors 


1 

1 


1 

T— 1 

1 

1 


1 — 

T— 1 

1 

1 

-1 

to 

II 

A 

* 

CO 

II 

-1 

1 

T— 1 

1 

1 


1 

T— 1 

1 

1 


1 

1 
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are linearly dependent for precisely those values of A for which the equation 
xX\ + yX 2 + zX 3 = 0 has a non-trivial solution. This equation is equivalent 
to the system of homogeneous equations 

A x — y — z = 0 
—x + A y — z = 0 
—x — y + \z = 0 . 

Now the coefficient determinant of this system is 

A -1 -1 

-1 A -1 = (A + l) 2 (A-2). 

-1 -1 A 

So the values of A which make X\, X- 2 , X 3 linearly independent are those A 
satisfying A / — 1 and A / 2 . 

5. Let A be the following matrix of rationals: 

'll 201 ' 

2 2 5 0 3 

0 0 0 1 3 ' 

_ 8 11 19 0 11 _ 

Then A has reduced row-echelon form 

' 1 0 0 0 -1 ' 

0 10 0 0 
“00101 
0 0 0 1 3 

From B we read off the following: 

(a) The rows of B form a basis for R(A). (Consequently the rows of A 
also form a basis for R(A).) 

(b) The first four columns of A form a basis for C(A). 

(c) To find a basis for N(A), we solve AX = 0 and equivalently BX = 0. 
From B we see that the solution is 

X'l = x 5 

x 2 = 0 
X 3 = -x 5 

X4 = —3x5, 
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with X 5 arbitrary. Then 


x 5 ' 


1 ■ 

0 


0 

-x 5 

= x 5 

-1 

-3x 5 


-3 

x 5 _ 


1 . 


so [1, 0, —1, —3, 1 Y is a basis for N(A). 


6 . In Section 1.6, problem 12, we found that the matrix 


10 10 1 
01011 
' “ 11110 
0 0 110 

has reduced row-echelon form 

' 1 0 0 1 1 
0 10 11 
0 0 110 
0 0 0 0 0 


From B we read off the following: 

(a) The three non-zero rows of B form a basis for R(A). 

(b) The first three columns of A form a basis for C(A). 

(c) To find a basis for N(A), we solve AX = 0 and equivalently BX = 0. 
From B we see that the solution is 


x\ = — X4 — X5 = 24 + X5 
X 2 = —X4 — X5 = X4 + X5 
X3 = —X4 = X4, 


with X 4 and X 5 arbitrary elements of Z 2 . Hence 


" X 4 + X 5 


' 1 " 


■ 1 " 

X 4 + X 5 


1 


1 

X 4 

= X 4 

1 

+ 

0 

X 4 


1 


0 

x 5 


_ 0 _ 


_ 1 _ 


Hence [1, 1, 1, 1, 0]* and [1, 1, 0, 0, 1]* form a basis for N(A). 
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7. Let A be the following matrix over Z 5 : 


"112 0 13" 

2 1 4 0 3 2 

0 0 0 1 3 0 ' 

_ 3 0 2 4 3 2 _ 

We find that A has reduced row-echelon form B: 

" 1 0 0 0 2 4 " 

0 1 0 0 4 4 

0 0 1 0 0 0 ' 

_ 0 0 0 1 3 0 _ 

From B we read off the following: 

(a) The four rows of B form a basis for R(A). (Consequently the rows of 
A also form a basis for R(A). 

(b) The first four columns of A form a basis for C(A). 

(c) To find a basis for N(A), we solve AX = 0 and equivalently BX = 0. 
From B we see that the solution is 

X\ = —2x5 — 4x6 = 3X5 + Xq 

x 2 = -4x 5 - 4x 6 = x 5 + x 6 

x 3 = 0 

x 4 = -3x 5 = 2x 5 , 

where X5 and X6 are arbitrary elements of Z5. Hence 



so [3, 1, 0, 2, 1, 0 ] 4 and [1, 1, 0, 0, 0, l] f form a basis for R(A). 

8 . Let F = {0, 1, a, 6 } be a field and let A be the following matrix over F : 

1 a 6 a 
A = abb 1 
_ 1 ! ! a _ 
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In Section 1.6, problem 17, we found that A had reduced row-echelon form 


B = 


10 0 0 
0 10 6 
0 0 11 


From B we read off the following: 


(a) The rows of B form a basis for R(A). (Consequently the rows of A 
also form a basis for R(A). 

(b) The first three columns of A form a basis for C(A). 

(c) To find a basis for N(A), we solve AX = 0 and equivalently BX = 0. 
From B we see that the solution is 


x\ = 0 

X 2 = —6x4 = 6x4 

X3 = — X4 = X4, 


where X4 is an arbitrary element of F. Hence 


X = X4 


0 

6 

1 

1 


so [0, 6, 1, 1]* is a basis for N{A). 


9. Suppose that X\, . . . , X m form a basis for a subspace S. We have to 
prove that 

X u X 1 + X 2 ,...,X 1 + --- + X m 
also form a basis for S. 

First we prove the independence of the family: Suppose 

xiXi + x 2 (X\ + X 2 ) + • • • + x m (X\ + • • • + X m ) = 0. 


Then 


(xi T x 2 T • • • T x m )~\i + • • • + x rn X rn — 0. 


Then the linear independence of X \ , . . . , X m gives 


xi + x 2 H hx m = 0 ,...,x m = 0 , 
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form which we deduce that x± = 0, . . . , x m = 0. 

Secondly we have to prove that every vector of S is expressible as a linear 
combination of X i , X\ + X 2 , - . . , X\ + • • • + X rn . Suppose X 6 S. Then 


X — Cl 1 X 1 T * * * T dmXfji. 


We have to find aq, . . . , x m such that 

A = x\X\ + X 2 (X\ + X2) + • • • + x m (X{ + • • • + X m ) 
— (x 1 X2 T ■ ■ ■ T x m ) Ai + • • • + x m X m . 


Then 


d\X\ + • • • + a m X m — (xi + X2 + • • • + x m ) X\ + • • • + x m X m . 
So if we can solve the system 

X 1 T X2 T ■ ■ ■ T X m — CLl, . . . ,X m — dm . 1 
we are finished. Clearly these equations have the unique solution 

xi — d! 0 . 2 ) ■ • • j Xm— 1 — d m dm , — 1 7 X m — d m . 


10. Let A 
d = b = c) , 


_ d b 
1 1 

then rank A 


. If [a, b, c ] is a multiple of [1, 1, 1], (that is, 
= 1. For if 


[a, b, c] = t[ 1, 1, 1], 


then 


R(A) = ([a, b , c], [ 1 , 1 , 1 ]) = <t[l, 1 , 1 ], [ 1 , 1 , 1 ]) = ([ 1 , 1 , 1 ]), 
so [1, 1, 1] is a basis for R(A). 

However if [a, b , c] is not a multiple of [1, 1, 1], (that is at least two 
of a, b, c are distinct), then the left-to-right test shows that [a, b , c] and 
[1, 1, 1] are linearly independent and hence form a basis for R(A). Conse- 
quently rankH = 2 in this case. 

11. Let S be a subspace of F n with dim S' = m. Also suppose that 
X\, ... , X rn are vectors in S such that S = (Xi, . . . , X m ) ■ We have to 
prove that X\, ... , X m form a basis for S; in other words, we must prove 
that Xi, ... , X rn are linearly independent. 
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However if X \ , . . . , X m were linearly dependent, then one of these vec- 
tors would be a linear combination of the remaining vectors. Consequently 
S would be spanned by m — 1 vectors. But there exist a family of m lin- 
early independent vectors in S. Then by Theorem 3.3.2, we would have the 
contradiction m < m — 1 . 

12. Let [x, y, z]* € S. Then x + 2y + 3z = 0. Hence x = —2 y — 3z and 


X 

y 

— 

1 

CO 

CM 

1 

1 

= y 

' -2 " 
1 


1 

1 

O CO 

1 

z 


z 


0 


L iJ 


Hence [— 2 , 1 , 0 ] 4 and [—3, 0, 1]* form a basis for S. 

Next (-1) + 2(— 1) + 3(1) = 0, so [-1, -1, 1]* G 5. 

To find a basis for S which includes [—1, —1, 1]*, we note that [— 2 , 1, 0]* 
is not a multiple of [—1, —1, l]d Hence we have found a linearly independent 
family of two vectors in S, a subspace of dimension equal to 2 . Consequently 
these two vectors form a basis for S. 

13. Without loss of generality, suppose that Ad = Ad- Then we have the 
non-trivial dependency relation: 


1-Xi + ( — l)Ad T OA 3 + • • • + 0X m — 0. 


14. (a) Suppose that X m+ i is a linear combination of Ad, . . . , X m . Then 
(X 1 , . . . , X m , X m - |_i) — (X 1 , . . . , X m ) 

and hence 

dim (Ad, • • • , X m , A^ m _(_i) — dim (A^, . . . , X m ). 

(b) Suppose that X m+ i is not a linear combination of Ad, . . . , X rn . If not 
all of X \ , . . . , X m are zero, there will be a subfamily X Cl , . . . , X Cr which is 
a basis for (Ad, . . . , X m ). 

Then as X. m+ \ is not a linear combination of X Cl , . . . , X Cr , it follows that 
X Cl , . . . , X Cr , X m+ 1 are linearly independent. Also 

(Ad, • • • , X rn , X m+ 1 ) — (X C1 , . . . , X . Cr , X m+ \). 

Consequently 

dim (Ad, . . .,X m , X m+1 ) = r + 1 = dim (Ad* • • . , X m ) + 1. 
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Our result can be rephrased in a form suitable for the second part of the 
problem: 

dim (.Xi, • • • j X m , -Xm+ 1) — dim (X^i, • • • , X m ) 
if and only if X m . |_i is a linear combination of X ±, . . . , X m . 

If X = [xi, . . . , XnY, then AX = B is equivalent to 

B — X\A*\ ~b • • • ~b XnA* n . 

So AX = B is soluble for X if and only if B is a linear combination of the 
columns of A, that is B 6 C(A). However by the first part of this question, 
B € C(A ) if and only if dim C{[A\B}) = dimC(A), that is, rank[A|H] = 
rank A. 

15. Let ai, . . . , a n be elements of F. not all zero. Let S denote the set of 
vectors [x i , . . . , where X\, ... ,x n satisfy 


a±xi H b a n x n = 0. 

Then S = N(A), where A is the row matrix [ai, . . . , a n \. Now rank A = 1 
as d / 0. So by the “rank + nullity” theorem, noting that the number of 
columns of A equals n, we have 

dim N(A) = nullity (H) = n — rank A = n — 1. 

16. (a) (Proof of Lemma 3.2.1) Suppose that each of Xi , . . . , X r is a linear 
combination of Y \ , . . . , Y s . Then 

S 

Xi = ^2 OijYj, (1 <i<r). 

3 = 1 

Now let X = j XjXi be a linear combination of X \ , . . . , X r . Then 

X = x\{a\\Y\ H baisY,) 

+ ••• 

~b x r ( Oj'f ] 1 1 ~b ~b n^^Tg) 

= Vi Yi H b y s Y s , 

where yj = a\jX\ +• • • +a r jX r . Hence X is a linear combination of Yi, . . . , Y s . 
Another way of stating Lemma 3.2.1 is 

{X 1 ,...,X r )C{Y n ?'.,Y 8 ), (1) 
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if each of X \ , . . . , X r is a linear combination of Yi, . . . , Y s . 

(b) (Proof of Theorem 3.2.1) Suppose that each of X\, . . . , X r is a linear 
combination of Yi , . . . , Y s and that each of Y \ , . . . , Y s is a linear combination 
of X, .... , X r . Then by (a) equation (1) above 

(X 1 ,...,x r ) c (Ti,...,y s ) 

and 

(li,...,n) c (X 1 ,...,X r ). 

Hence 

<Xi ,...,X r ) = {Y 1 ,...,Y s ). 

(c) (Proof of Corollary 3.2.1) Suppose that each of Z \, . . . , Z t is a linear 
combination of X \ . . . . , X r . Then each of X \ , . . . , X r , Z\, . . . , Zf is a linear 
combination of X \ , . . . , A r . 

Also each of X \, . . . , A r is a linear combination of X \, . . . , A r , Zi, . . . , Zt, 
so by Theorem 3.2.1 

(X u ...,X r , Z u ...,Z t ) = (Xi , . . . , X r ). 

(d) (Proof of Theorem 3.3.2) Let Y\, . . . ,Y S be vectors in {X \, . . . , X r ) 
and assume that s > r. We have to prove that Y\,...,Y S are linearly 
dependent. So we consider the equation 


x\Y\ 4 1- x s Y s = 0. 

Now Yi = '- = j ciijXj, for 1 < i < s. Hence 

x\Y\ H b x s Y s = x'i(anAiH b a\ r X r ) 

+ ■■■ 

~b ^rr(^sl j Ar "b * * * + Cl S rX r ). 

= yiX\ + • • • + y r X r , (1) 

where yj = aij-xq + • • • + a s jX s . However the homogeneous system 

2/1 = 0, • • • , y r = 0 

has a non-trivial solution x\, . . . ,x s , as s > r and from (1), this results in a 
non-trivial solution of the equation 


xiYi H b x s Y s = 0. 
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Hence Yi, ... ,Y S are linearly dependent. 

17. Let R and S be subspaces of F n , with R C S. We first prove 

dim R < dim S. 

Let Xi , . . . , X r be a basis for R. Now by Theorem 3.5.2, because X \, . . . , X r 
form a linearly independent family lying in S, this family can be extended 
to a basis X{, . . . . X r , . . . ,X S for S. Then 

dim S = s > r = dim R. 

Next suppose that dim R = dim S. Let X \ , . . . , X r be a basis for R. Then 
because X \ , . . . , X r form a linearly independent family in S and S is a sub- 
space whose dimension is r, it follows from Theorem 3.4.3 that X \ , . . . , X r 
form a basis for S. Then 


S = {X 1 ,...,X r ) = R. 

18. Suppose that R and S are subspaces of F n with the property that RU S 
is also a subspace of F n . We have to prove that R C S or S C R. We argue 
by contradiction: Suppose that R $2 S and S £ R. Then there exist vectors 
u and v such that 


u € R and v 0 S, v G S and v 0 R. 

Consider the vector u + v. As we are assuming R U S is a subspace, R U S is 
closed under addition. Hence u + v G R U S and so u + v £ R or u + v G S. 
However if u + v G R, then v = (u + v) — u G R, which is a contradiction; 
similarly if u + v G S. 

Hence we have derived a contradiction on the asumption that R S and 
S £ R. Consequently at least one of these must be false. In other words 
R C S or S C R. 

19. Let Xi t i . . , X r be a basis for S. 

(i) First let 


Hi 

= an-Xi + • ■ 

■ * CL\ r f 

Y r 

= a r iAd + • 1 



( 2 ) 
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where A = \a t] \ is non-singular. Then the above system of equations can 
be solved for X\, . . . , X r in terms of Yi, . . . , Y r . Consequently by Theorem 
3.2.1 

(Y u ...,Y r ) = {X 1 ,...,X r ) = S. 

It follows from problem 11 that Y\, ... ,Y r is a basis for S. 

(ii) We show that all bases for S are given by equations 2. So suppose 
that Yi, . . . ,Y r forms a basis for S. Then because X\ , . . . , X r form a basis 
for S, we can express Yi, . . . , Y r in terms of X\ . . . . , X r as in 2, for some 
matrix A = We show A is non-singular by demonstrating that the 

linear independence of Y\, ... ,Y r implies that the rows of A are linearly 
independent. 

So assume 

3-1 [flll ) • • • i Air] T ' ' ' T X r \d r 1 , • • • , flrr] — [0, . . . , 0] . 

Then on equating components, we have 


All^l T * * * T CLr lXr — 0 


C X\j.X\ ~ t - * ' ' Qjy'j.Xx 0. 


Hence 


x \ Y{ T * * * T x r Y r — -)- • • • T ai r X r ') -)- • • • T x r (a r \X\ T ■ ■ ■ T a rr X r ^j 

— (unXi + • • • + CL r \x r ')X\ + • • • + (di r x\ + • • • + a rr x r 'j X r 
= OXi + • • • + ow = o. 


Then the linear independence of Vi , . , V,. implies x\ = 0, . . . , x r = 0. 

(We mention that the last argument is reversible and provides an alter- 
native proof of part (i).) 
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Section 4.1 

1 . We first prove that the area of a triangle P1P2P3, where the points 
are in anti-clockwise orientation, is given by the formula 


f X ! 

X2 

+ X2 

X3 

+ 

Xl 

1 yi 

2/2 

2/2 

2/3 

2/3 

2/1 J 


Referring to the above diagram, we have 

Area Pi P2 P3 = Area OP\ P2 + Area OP2P3 — Area OP\ P3 


Xl 

X2 

1 

+ — 

X2 

X3 

1 

Xl 

®3 

2/1 

2/2 

2 

2/2 

2/3 

2 

2/1 

2/3 


which gives the desired formula. 

We now turn to the area of a quadrilateral. One possible configuration 
occurs when the quadrilateral is convex as in figure (a) below. The interior 
diagonal breaks the quadrilateral into two triangles P1P2P3 and P1P3P4. 
Then 

Area P1P2P3P4 = AreaPiP2P3 + Area P\ P3P4 


f ;C] 

2:2 

+ X2 

X3 

+ X3 

Xl j 

l 2/1 

2/2 

2/2 

2/3 

2/3 

2/1 J 
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Pa Pi 





Xl x 3 

yi 2/3 


X 3 X '4 
2/3 2/4 


X 4 Xl 
2/4 Vl 


1 

2 


XI X2 
2/1 2/2 


X 2 x 3 
2/2 2/3 


X3 X 4 
2/3 2/4 


X4 Xl 

2/4 2/1 


after cancellation. 

Another possible configuration for the quadrilateral occurs when it is not 
convex, as in figure (b). The interior diagonal -P2-P4 then gives two triangles 
P1P2P4 and P2-P3P4 and we can proceed similarly as before. 


2. 



a + x 

b + y 

c + z 


a 

b 

c 


X 

y 

z 

A = 

x + u 

y + v 

z + w 

= 

x + u 

y + v 

z + w 

+ 

X + u 

y + v 

z + w 


u + a 

v + b 

w + c 


u + a 

v + b 

VJ + c 


u + a 

v + b 

VJ + c 


Now 


a 

b 

c 


a 

b 

c 


a 

b 

c 

x + u 

y + v 

z + w 

= 

X 

y 

z 

+ 

u 

V 

w 

u + a 

v + b 

w + c 


u + a 

v + b 

w + c 


u + a 

v + b 

w + c 


a 

b 

c 


a 

b 

C 


a 

b 

c 


a 

b 

c 

X 

y 

z 

+ 

X 

y 

z 

+ 

u 

V 

w 


u 

V 

w 

u 

V 

w 


a 

b 

c 


u 

V 

w 


a 

b 

c 


a b c 
x y z 
u v w 


Similarly 


X 

y 

z 


X 

y 

z 


X 

y 

z 


a 

b 

c 

X + u 

y + v 

z + w 

= 

u 

V 

VJ 

= — 

a 

b 

c 

= 

X 

y 

z 

u + a 

v + b 

w + c 


a 

b 

c 


u 

V 

VJ 


u 

V 

VJ 
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Hence A = 2 


a b c 
x y z 
u v w 



n 2 

(n + l) 2 

(n + 2) 2 

Gi - 

+ c 3 -c 2 

n 2 

2n + 1 

2n + 3 

3. 

(n + l) 2 

(n + 2) 2 

(n + 3) 2 

c 2 - 

- C 2 - C\ 

(n + l) 2 

2n + 3 

2n + 5 


(n + 2) 2 

(n + 3) 2 

(n + 4) 2 


= 

(n + 2) 2 

2n + 5 

2n + 7 


c 3 ^c 3 - c 2 


n 2 2n + 1 2 

(n + 1) 2 2n + 3 2 
(n + 2) 2 2n + 5 2 


R3 — ► R3 — Rz 
R‘2 — ' R‘2 — Ri 


n 2 2n + 1 
2n + 1 2 

2n + 3 2 


2 

0 

0 


= - 8 . 


4. (a) 


246 

427 

327 


246 

100 

327 


246 

1 

327 

1014 

543 

443 


1014 

100 

443 

= 100 

1014 

1 

443 

-342 

721 

621 


-342 

100 

621 


-342 

1 

621 


= 100 


246 1 327 
768 0 116 
-588 0 294 


100 (- 1 ) 


768 116 
-588 294 


-29400000. 


(b) 


1 

2 

3 

4 


1 

2 

3 

4 

-2 

1 - 

-4 

3 


0 

5 

2 

11 

3 

-4 - 

-1 

2 


0 

-10 

-10 

-10 

4 

3 - 

-2 

— 1 


0 

-5 

-14 

-17 

5 

2 


11 



5 

2 

11 

-10 

-10 

- 

-10 


= — 

10 

1 

1 

1 

-5 

-14 

- 

-17 



-5 

-14 

-17 



5 

-3 

6 


10 

1 

0 

0 

= -10(-1) 


-5 

-9 

-12 



900 . 


5. detH 


1 0 

3 1 

-2 

4 

_ 

1 0 

3 1 

0 

10 


1 10 

2 7 

5 2 

-3 


5 2 

7 



-13. 
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Hence A is non-singular and 


A 1 = adj A = 

-13 J -13 


C u C' 2 \ C 3 1 

Cl2 C*22 C32 

C 1 .3 C23 C33 


1 

—13 


-11 -4 2 

29 7 -10 

1 -2 1 


6. (i) 



2 a 

26 

6 — c 

Ri 

— * R\ + R 2 

2a + 26 

26 + 2a 

6 + a 



26 

2 a 

a + c 

26 

2 a 

a + c 



a + 6 

a + 6 

6 



a + 6 

a + 6 

6 




2 

2 


Cl - <?! - 

° 2 (a+b) 

0 

2 

1 

(a+6) 


26 

2a a + c 

2(6 — a) 

2a 

a + c 


a 

+ 6 a 

+ 6 

6 



0 

a + 6 

6 


= 2(a + 6)(a — b ) 


2 1 
a + b b 


= — 2(a + 6)(a — 6) 2 . 


(ii) 


6 + c b 
c c + a 
& a 

c 3 ^c 3 - c\ 


R3 R3 — R2 


c 

a 


C\ -> Cl - c 2 


c b 
—a c + a 


c 

a 


a 

+ 6 | 



1 b - 

-a a a + 6 | 


c 

6 

0 


c 6 0 

—a 

c + a 

2a 

= 2 a 

—a c + a 1 


6 — a 

a 

2a 


6 — a a 1 



c 6 0 



2a 

—a c + a 1 

6 — c 0 

= —2a 

C 0 

b —c 


= 2 a(c 2 + 6 2 ). 


7 . Suppose that the curve y = ax 2 + bx + c passes through the points 
(xi, yi), (x 2 , ?/2), (x 3l y 3 ), where Xi / xj if i / j. Then 

ax\ + tei + c = y\ 
ax\ + bx 2 + c = y 2 
ax\ + bx 3 + c = 2/3. 


The coefficient determinant is essentially a Vandermonde determinant: 


J2 

x 1 

Xi 

1 


*1 

„.2 

x 2 

„.2 

X 3 


1 

1 

1 


J2 

x 2 

3+ 

1 

= 

3+ 

X2 

X 3 

= — 

Xl 

3+ 

3+ 

= -(x 2 -xi)(x 3 -xi)(x 3 -x 2 ) 

™2 

X 3 

®3 

1 


1 

1 

1 


™2 

Xi 

™2 

x 2 

,,.2 

x 3 
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Hence the coefficient determinant is non-zero and by Cramer’s rule, there 
is a unique solution for a, b , c. 


8. Let A = det A = 


1 1 
2 3 
1 k 


-1 

k 

3 


. Then 


A = 


1 0 0 
2 1 k + 2 

1 k- 1 4 

= 4 - (fc - l)(k + 2) = -( k 2 — fc — 6) = — (fc + 3)(fc - 2). 


C 3 

C 2 


Cz + Cv 
C 2 -C x 


1 

fc — 1 


k + 2 
4 


Hence det H = 0 if and only if k = —3 or k = 2. 

Consequently if A; / — 3 and k ^ 2, then det 4^0 and the given system 

x + y — z = 1 

2x + 3y + fcz = 3 

x + ky + 3z =2 

has a unique solution. We consider the cases k = —3 and k = 2 separately. 
k = — 3 : 


M = 


1 1-11 
2 3-3 3 

1-3 3 2 


Ri- 

R 3 


R 2 — 2R± 
> f?3 — R\ 


1 

0 

0 


1 -1 1 
1 -1 1 
-4 4 1 


1 1 
0 1 
0 0 


-1 1 
-1 1 
0 5 


R 3 — * R 3 + 4f?2 

from which we read off inconsistency. 
k = 2: 


AM = 

[ 1 2 3 2 \ U3 ~ 

' 10-50 
— f?2 0 1 4 1 

0 0 0 0 

We read off the complete solution x = 5 z, y = 1 — 4z, where z is arbitrary. 


' 1 1 -1 1 ' 

2 3 2 3 

R 2 - 

■> R 2 — 2R\ 

' l l -l l ' 
0 1 4 1 

12 3 2 

R 3 - 

R 3 — R \ 

0 1 4 1 


46 



Finally we have to determine the solution for which x 2 + y 2 + z 2 is least. 


2 i 2 i 2 

x +y + z 


(5 z) 2 + (1 - 4 zf + z 2 = 42 z 2 - 8z + 1 


42(z 2 z + — ) = 42 

v Ol 40/ 


21 


42' 


2 \ 


z~— +m- 


21 / 


42 



2 


42 


h-4V + ii 

V 21 J 882 


We see that the least value of x 2 +y 2 +z 2 is 42 x and this occurs when 

z = 2/21, with corresponding values x = 10/21 and y = 1 — 4x ^ = 13/21. 


9. Let A 


1 -2 6 
a 0 2 
5 2 0 


be the coefficient determinant of the given system. 


Then expanding along column 2 gives 


A 


a 2 
5 0 



1 b 
a 2 


2ab- 24 = 2{ab- 12). 


-20 - 2(2 - ab) 


Hence A = 0 if and only if ab = 12. Hence if ab ^ 12, the given system has 
a unique solution. 

If ab = 12 we must argue with care: 



' 1 

-2 

6 

3 ' 


' 1 

-2 

6 

3 1 

AM = 

a 

0 

2 

2 

-> 

0 

2a 

2 — ab 2 — 3a 


5 

2 

0 

1 


0 

12 

-56 

-14 1 


' 1 

-2 

6 

3 



" 1 

-2 

6 

3 

0 

1 

—56 


[ 


0 

1 

-56 

-7 

12 

“6 


> 

12 

6 

_ 0 

2a 

2 — ab 

2 - 

3a 


_ 0 

0 

12 — ab 

6 

1 

<3 

-CO 

ri 

-2 

b 

3 1 







Hence if 6 — 2a / 0, i.e. a / 3, the system has no solution. 
If a = 3 (and hence 6 = 4), then 


■ 1 

CM 

1 

3 


' 1 

0 

-2/3 

2/3 " 

0 

1 =5 

3 

-7 


0 

1 

-5 

-7 

6 


3 

6 

0 

0 0 

0 


0 

0 

0 

0 
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Consequently the complete solution of the system is x = | + | z, y = -^ + | z, 
where z is arbitrary. Hence there are infinitely many solutions. 

10. 


1 

1 

2 1 


— > 7?4 — 2i?i 

1 1 

2 1 

1 

2 

3 4 

A3 

— ► 7?3 — 2i?i 

0 1 

1 3 

2 

4 

7 2f + 6 

i?2 * 7?2 — 72 1 

0 2 

3 2f + 4 

2 

2 

6 — t t 



0 0 

2-t t-2 

1 


1 3 1 


1 

1 

3 

2 


3 2t + 4 2 ^ 

i?2 — 2i?i q 

1 

2t — 2 

0 

2 

-t t- 2 


0 

2-t 

t-2 


1 

2t - 2 


1 2t - 2 



2 

- f 

t — 2 =<*- 

-2) 

-1 1 

= {t~ 

2)(2t — 1). 


Hence A = 0 if and only if t = 2 or t = . Consequently the given matrix 

B is non-singular if and only if t ^ 2 and t ^ 

11. Let A be a 3 x 3 matrix with det d/0. Then 

(i) 

A ad] A = (det A)I^ (1) 

(det A) det ( adj A) = det (det A ■ I3) = (det A) 3 . 

Hence, as det A / 0, dividing out by det A in the last equation gives 

det ( adj A) = (det A) 2 . 

(ii) . Also from equation (1) 

(dib' 4 ) adjA=/3 ’ 

so adj A is non-singular and 

(adjA) " 1 = di u A - 

Finally 

A -1 adj (A -1 ) = (det A -1 )/ 3 

and multiplying both sides of the last equation by A gives 
adj (A^ 1 ) = A(det A _1 )/ 3 = 
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12. Let A be a real 3x3 matrix satisfying A 1 A = I 3 . Then 


(i) A\A - h) = A t A -A t = I 3 -A t 

= -( A t -I 3 ) = -(A t -I t 3 ) = -(A-I 3 ) t . 

Taking determinants of both sides then gives 

det A t det (A — I 3 ) = det (— (A — I 3 ) 1 ) 
det A det {A — I 3 ) = (— l) 3 det (A — I?) 1 

= — det (A — I 3 ) (1). 

(ii) Also det A A* = det I 3 , so 

det A 1 det A = 1 = (det A) 2 . 


Hence det A = ±1. 

(iii) Suppose that det A = 1. Then equation (1) gives 

det ( A — I 3 ) = — det {A — I 3 ), 

so (1 + 1) det (A — I 3 ) = 0 and hence det (A — I 3 ) = 0. 

13. Suppose that column 1 is a linear combination of the remaining columns: 


A*! — X 2 A *2 + • • • + x n A* n . 


Then 


det A = 


^2^12 + • • • + x n ai n a 12 
3^22 + • • • + x n a 2 n an 


ain 

«2n 


X 2 an 2 T ■ ■ ■ T x n a nn a n 2 ■ ■ ■ a nn 


Now det A is unchanged in value if we perform the operation 


Cl - 

,Ci- 

X2C2 - ■ 

• • — x. 



0 

a\2 • • • 

n 

det A 

= 

0 

022 • • • 

U 2 n 



0 

a n 2 ■ ■ ■ 

Q"nn 
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Conversely, suppose that det H = 0. Then the homogeneous system AX = 0 
has a non-trivial solution X = [aq, . . . , x n ]L So 

xiA*i • • • T XfiA^yi — 0. 


Suppose for example that x\ / 0. Then 



and the first column of A is a linear combination of the remaining columns. 
14. Consider the system 


—2x + 3y — z = 1 

x + 2y — z = 4 

— 2x — y + z = —3 


Let A = 


CM 

1 

3 

-1 


o 

7 

1 

CO 


7 -3 

1 

2 

-1 

= 

1 

2 

-1 

1 

II 

3 -1 

CN 

1 

-1 

1 


0 

3 

-1 



= -2 / 0 . 


Hence the system has a unique solution which can be calculated using 
Cramer’s rule: 


where 


-4 

—2 



Ai 


A 2 


A 3 

x - 

A : 

. y = 

: X’ 

Z 

“ A ’ 



1 

3 

-1 


Ai 

= 

4 

2 

-1 

= -4, 



-3 

-1 

1 




-2 

1 

-1 


A 2 

= 

1 

4 

-1 

= -6, 



-2 

-3 

1 




-2 

3 

1 


A 3 

= 

1 

2 

4 

= -8. 



-2 

-1 

-3 


y = : 

=§ = 3 > 2 = 

-8 

-2 

= 4. 



15. In Remark 4.0.4, take A = I n . Then we deduce 


(a) det Eij = — 1; 

(b) det Ei(t) = t; 
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(c) det Eij(t) = 1. 


Now suppose that B is a non-singular nx n matrix. Then we know that B 
is a product of elementary row matrices: 

B = Ei ■ ■ ■ E m . 

Consequently we have to prove that 

det E\ ■ ■ ■ E m A = det E\ ■ ■ ■ E m det A. 


We prove this by induction on m. 

First the case m = 1. We have to prove det E\A = det E\ det A if E\ is 
an elementary row matrix. This follows form Remark 4.0.4: 

(a) det EijA = — det A = det E tJ det A ; 

(b) det Ej (t)A = t det A = det Ej(t) det A; 

(c) det Eij{t)A = det A = det Eij(t ) det A. 

Let rn > 1 and assume the proposition holds for products of m elementary 
row matrices. Then 


det E\ • • • E m E m+ iA 


det (Ei ■ ■ ■ E m )(E m+l A) 
det (Ei--- E m ) det (E m+ iA) 
det (Ei ■ ■ ■ E m ) det E m+ \ det A 
det ((Ei ■ ■ ■ E m )E m+ 1 ) det A 


and the induction goes through. 

Hence det BA = det B det A if B is non-singular. 

If B is singular, problem 26, Chapter 2.7 tells us that BA is also singlular. 
However singular matrices have zero determinant, so 


det B = 0 det BA = 0, 


so the equation det BA = det B det A holds trivially in this case. 


16. 


a + b + c a + b 
a + b a + b + c 
a a 

a a 


a a 

a a 

a + b + c a + b 
a + b a + b + c 
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R\ - 

i?i 

-i?2 

c 

— c 

0 

0 

r 2 - 

■4 i?2 

- R:>, 

6 

6 + c 

— 6 — c 

-6 

Ri - 

i? 3 

— r 4 

0 

0 

c 

— c 


= 


a 

a 

a + 6 

a + 6 + c 


C 2 -> C 2 + C 1 


c 3 -> c 3 + c 2 


c 0 0 0 

6 2 b + c —b — c —b 
0 0 c — c 

a 2 a a + 6 a + 6 + c 


= c 


26 + c — 6 — c —6 

0 c — c 

2a a + 6 a + 6 + c 


26 + c 

— 6 — c 

—26 — c 

2 

26 + c 

-26- c 

0 

2 a 

c 

a + 6 

0 

2a + 26 + c 

= <r 

2 a 

2a + 26 + c 


= c 2 (26 + c) 


1 -1 
2a 2a + 26 + c 


= c 2 (26 + c)(4a + 26 + c). 


17. Let A 


1 + ai 

Ul 

Ul 

Ul 

u 2 

l+u 2 

U 2 

U 2 

u 3 

U 3 

1 + ^3 

U 3 

u 4 

u 4 

1X4 

1 + Xi4 


. Then using the operation 


Ri — > R\ + i?2 + 7?3 + R 4 


we have 


A = 


t t t t. 

u 2 1 + u 2 u 2 u 2 

u 3 u 3 1 + u 3 u 3 

U 4 U 4 U 4 1 + U 4 


(where t = 1 + u\ + u 2 + u 3 + 114 ) 


= (1 + Ul + U 2 + U 3 + U 4 ) 


The last determinant equals 


C 2 ^C 2 

C 3 ^C 3 

c 4 -> c 4 


Cl 

Cl 

Cl 


1 

1 

1 

1 

^2 

1 + «2 

U 2 

U 2 

^3 

u 3 

1+^3 

U 3 

u 4 

u 4 

U 4 

1 + XX4 


1 

0 

0 

0 



1 

0 

0 

= 1 

«3 

0 

1 

0 

1X4 

0 

0 

1 
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18. Suppose that A 4 = —A, that A e M nxn (F), where n is odd. Then 


detH 4 = det(— A) 
detH = (— l) n det A = — det A. 

Hence (1 + 1) det A = 0 and consequently det A = 0 if 1 + 1 =/=■ 0 in F. 
19. 


1 

i 

1 

1 

c 4 - 

■+ c 4 

-C's 

1 

0 

0 

0 

r 

i 

1 

1 

c 3 - 

- C's 

-c 2 

r 

1 — r 

0 

0 

r 

r 

1 

1 

c 2 - 

-+c 2 

-Cl 

r 

0 

1 — r 

0 

r 

r 

r 1 


= 


r 

0 

0 

1 — r 


20. 


1 a 2 — be a 4 

lh - 

-* i?2 — Rl 

1 a 2 — be a 4 

1 b 2 — ca b 4 

1 c 2 — ab c 4 

R*~ 

R 3 — Ri 

0 b 2 — ca — a 2 + be b 4 — a 4 

0 c 2 — ab — a 2 + be c 4 — a 4 


Finally 


b 2 — ca — a 2 + be b 4 — a 4 
c 2 — ab — cl 2 + be c 4 — a 4 


(b — a)(b + a) + c(b — a) (b — a)(b + a){b 2 + a 2 ) 
(c — a)(c + a) + b(c — a) (c — a)(c + a)(c 2 + a 2 ) 

(b — a)(b + a + c) (6 — a) (6 + a)(6 2 + a 2 ) 

(c — a)(c + a + 6) (c — a)(c + a)(c 2 + a 2 ) 


(b — a)(c — a) 


6 + a + c (6 + a)(6 2 + a 2 ) 
c+a + 6 (c + a)(c 2 + a 2 ) 


(6 — a)(c 


a) (a + 6 + c) 


1 (6 + a){b 2 + a 2 ) 

1 (c + a)(c 2 + a 2 ) 


1 (b + a)(b 2 + a 2 ) 
1 (c + a)(c 2 + a 2 ) 


(c 3 + ac 2 + ca 2 + a 3 ) — (6 3 + ab 2 + ba 2 + a 3 ) 

(c 3 - b 3 ) + a(c 2 - b 2 ) + a 2 (c - 6) 

(c — 6)(c 2 + c6 + b 2 + a(c + b) + a 2 ) 

(c — b)(c 2 + cb + bi 2 + ac + ab + a 2 ). 
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1. 


Section 5.8 


(i) (—3 + z) (14 — 2*) 


(-3)(14-2*)+*(14-2*) 

{(-3)14 - (— 3)(2*)} + *(14) - *(2t) 
(-42 + 6i) + (14* + 2) = -40 + 20*. 


2 + 3* _ (2 + 3*)(1 + 4*) 

1-4* ~ (1 -4*)(l + 4*) 

((2 + 3*) + (2 + 3*) (4*) 
l 2 + 4 2 

-10 + 11 * -10 11 

= = 1 * 

17 17 17 


(m) (1 + 2*) 2 = l + 4* + ( 2*) 2 

^ Ul) 1 — * “ 1 — * 

1 + 4* - 4 _ -3 + 4* 

1 — * 1 — * 

(-3 + 4*)(l + *) _ -7 + * _ 7 | 1 . 

“ 2 ~ 2 _ ~2 + 2 * 


2. (i) 

iz + (2 — 10*) 2 = 3^ + 2* z(i + 2 — 10* — 3) = 2* 

/ n -2* 

=44> z(— 1 — 9*) = 2* 2 = j — — 

— 2*(1 - 9*) _ -18 - 2* _ -9 - * 
1 + 81 “ 82 “ 41 


(ii) The coefficient determinant is 


1 + * 2 — * 
1 + 2* 3 + * 


(1 + *)(3 + *) - (2 - *)(1 + 2*) = -2 + * ± 0. 


Hence Cramer’s rule applies: there is a unique solution given by 


2 


W 


—3* 2 — * 

2 + 2* 3 + * 
—2 + * 

1 + * —3* 

1 + 2 * 2 + 2 * 
—2 + * 


-3 - 11* 
— 2 + * 

—6 + 7* 
— 2 + * 


-1 + 5* 


19 - 8* 
5 
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3. 


1 + (1 + z) H h (1 + i) 99 


Now (1 + i ) 2 = 2 i. Hence 


(1 + x ) 100 - 1 

( 1 + 0 M °-i = _ <{(1 + <r _ 1} . 


(1 + i) 1 ™ = (2t ) 50 = 2 50 t 50 = 2 5 »(-1) 25 = -2 5 “. 


Hence -z {(1 + i) 100 - l} = -z(- 2 50 - 1) = (2 50 + 1 )i. 

4. (i) Let z 2 = —8 — 6 i and write z=x+iy, where x and y are real. Then 

z 2 = x 2 — y 2 + 2 xyi = — 8 — 6z, 


so x 2 — y 1 = —8 and 2 xy = —6. Hence 



so .T 4 + 8x 2 — 9 = 0. This is a quadratic in x 2 . Hence x 2 = 1 or —9 and 
consequently x 2 = 1. Hence x = 1, ?/ = -3 or x = -1 and y = 3. Hence 
z = 1 — 3i or z = — 1 + 3 i. 

(ii) z 2 — (3 + i)z + 4 + 3i = 0 has the solutions z = (3 + i ± d)/2, where d is 
any complex number satisfying 

d 2 = (3 + i) 2 — 4(4 + 3z) = — 8 — 6 i. 


Hence by part (i) we can take d = 1 — 3z. Consequently 


3 + z±(l-3z) 0 . 

2 = — = 2 — i 


or 


1 + 2 i. 


V 


(i) The number lies in the first quadrant of 

the complex plane. 

|4 + z| = \J 4 2 + l 2 = Vl7. 


4 + i 

. x 


Also Arg (4 + i) = a, where tan a = 1/4 

and 0 < a < 7r/2. Hence a = tan — 1 (1/ 4) . 
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(ii) The number lies in the third quadrant of 
the complex plane. 


—3 -i 
2 


-3-z 

2 




a/10 

~Y~ 


y 



X 


Also Arg ( ~ = — 7T + a, where tan a = 

\l\ = 1/3 and 0 < a < zr/2. Hence a = 
tan — 1 (1/3) . 

(iii) The number lies in the second quadrant of 
the complex plane. 

| - 1 + 2i| = 1) 2 + 2 2 = VE. 

Also Arg(— l + 2z) = 7r — a, where tan a = 
2 and 0 < a < 7t/ 2. Hence a = tan _1 2. 


(iv) The number lies in the second quadrant of 
the complex plane. 


-1 + 2 z 



-l + n/3 _ | - l + n/3| 

2 _ 2 

= ^\/(-l) 2 + (V3) 2 = ^vT+3 = 1. 

Also Arg ( 4 / + ^z) = 7r — a, where 
tana = ^/| = \/3 and 0 < a < 7 t/2 . 
Hence a = 7 t/3. 


6. (i) Let z = (1 + z)(l + \Z3*)(\/3 — z). Then 
|z| = |1 + * j 1 1 + \/3z|| \/3 — z| 

= \/l 2 + l 2 \/l 2 + (V3) 2 \/(V3) 2 + (-1) 2 

= v / 2v / 4v / 4 = 4\/2. 


Arg 2 = Arg (1 + z) + Arg (1 + a/3) + Arg (a/3 — z) (mod 27 t) 
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_ 7T 7 T 7T 5 

= 4 + 3 _ 6 = 12' 

Hence Arg z = A and the polar decomposition of z is 


z = 4\/2 


57T 57T 

c ° s _ +!sm _ 


(ii) Let z 


(l+i) 5 (l-»\/3) 5 
( a / 3 +*) 4 


. Then 


|(l + i)| 5 |(l-n/3)| 5 = (V2) 5 2° = 2?/2 
|(i/3 + i)| 4 2“ 


Arg z 


Arg (1 + i ) 5 + Arg (1 — VSi) 5 — Arg (a/ 3 + z) 4 
5Arg (1 + z) + 5Arg (1 — VSi) — 4Arg (\/3 + z) 


7 r 

5— + 5 
4 




-13? r 
12 


1 1 7T 
12 " 


(mod 27 t) 


Hence Arg z = ^ry and the polar decomposition of z is 


z = 2 7 / 2 


117T ll7r\ 

cos _ +isin _j 


7. (i) Let z = 2(cos j + zsin |) and w = 3(cos | + zsin |). (Both of these 
numbers are already in polar form.) 

(a) zw = 6(cos(f + f) + zsin(f + |)) 

= 6(cos y> + *sin if )• 

( b ) w = K cos (l - f) + *sin(^ - |)) 

= |(cos + zsin 2L). 

( c ) 7 = l(cos(|-i) + *sm(f-f)) 

= l(c°s(=f) + zsin(=f)). 

(d) ^ = |f(cos(^-f) + zsin(f -f)) 

= f (cos^ Aisin^f). 

(a) (1 + i ) 2 = 2z, so 

(1 + *) 12 = (2 z) 6 = 2 6 z 6 = 64(z 2 ) 3 = 64(— l) 3 = -64. 
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— i, so 
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Geometrically, the solutions lie equi-spaced on the unit circle at arguments 


7T 7T 7T 57T7T 9l f f 137T 

8’ 8 + 2 _ Y’ 8 +7r_ T’ 8 +3 2 

Also Z 2 = — zq and Z 3 = —z\. 

(iii) To solve the equation z 3 = — 8 i, we rewrite the equation as 

3 

= 1. 


Then 


-2 i 


= 1, 


-2 i / 

-1 + y/3 i 


or 


-1 - v/3 i 


Hence z = —2 i, + i or — \/3 + i. 

Geometrically, the solutions lie equi-spaced on the circle \z\ = 2, at 
arguments 

7T 7T 27T 7T 27T 37T 

6’ 6 + Y _ "fT’ 6 + 3 _ T' 


(iv) To solve z 4 = 2 — 2i, we write 2 — 2i in modulus-argument form: 


2-2 * = 2 3/2 


/ — 7T — 7r\ 

cos — ■ — b 1 sin — — . 
V 4 4 / 


Hence the solutions are 




2 3 / 8 


cos 


Y + 2 kn 


1 sm 


Y + 2A:7r\ 

^ J' 


k = 0, 1, 2, 3. 


We see the solutions can also be written as 


Zk 


2 3/ V ( cos ^_ isin i). 


Geometrically, the solutions lie equi-spaced on the circle \z\ = 2 3 / 8 , at ar- 
guments 


— 7 r — 7T 7 T 77T — 7T 7T 157T — 7T 7T 237T 

~16 i "l6~ + 2 “ 16’ l6" + 2 2 _ "16”’ Iff +3 2 “ ”16"' 

Also Z 2 = —zq and Z 3 = —z\. 


59 



9. 


2 + i 

-1 + 2* 

2 

1 + i 

-1 + i 

1 

1 + 2* 

— 2 + i 

1 + i 

r 2 ^ 

R'2 (1 + 

*)R\ 


R\ —■ ► R\ — R-2 
R 3 ^ R 3 — R-2 


1 i 1 

1 + i — 1 + i 1 
i — 1 i 


R 3 — ► -R 3 — i-Ri 

-Ri — ► Ri — R2 


' l 

i 

1 1 



' 1 

i 

1 ' 

0 

0 -i 

Ra~ 

->• iii 2 

0 

0 

1 

0 

0 

0 



0 

0 

0 


1 * 0 

0 0 1 

0 0 0 


The last matrix is in reduced row-echelon form. 

10. (i) Let p = l + im and z = x + iy. Then 

pz + pz = (l — im)(x + iy) + (l + im)(x — iy) 

= (lx + liy — imx + my) + (lx — liy + imx + my) 

= 2 (Ix + my). 

Hence pz + pz = 2n lx + my = n. 

(ii) Let w be the complex number which results from reflecting the com- 
plex number z in the line lx + my = n. Then because p is perpendicular to 
the given line, we have 


w — z = tp, t £ 


(a) 


Also the midpoint °f the segment joining w and z lies on the given line, 
so 


w + z 
2 

w + z 


+ p 
+ p 


w + z 

2 

w + z 


= n, 


= n. 


(b) 


Taking conjugates of equation (a) gives 

w — z = tp. (c) 

Then substituting in (b), using (a) and (c), gives 


_ f 2 w — tp 
V 


+ P 


2 z + tp 


= n 
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and hence 


pw + pz = n. 


(iii) Let p = b — a and n = \b \ 2 — \a\ 2 . Then 


\z — a\ = \z — b | 
(z — a)(z — a) 
(z — a)(z — a) 
zz — az — za + aa 
(b — a)z + (b — a)z 
<5pz + pz 


<=> \z - a \ 2 = \z- b \ 2 
= {z-b){—b) 

= (z-b)(z-b) 

= zz — bz — zb + bb 
= \b\ 2 ~\a \ 2 


Suppose z lies on the circle | 
line pz + pz = n. Then by part (ii) 


a 

b 


and let w be the reflection of 2 in the 


pw + pz = n. 


Taking conjugates gives pw + pz = n and hence 

n — pw 


z = 


P 


(a) 


Substituting for z in the circle equation, using (a) gives 


However 


n—pw 



p a 


n — pw — pa 

n—pw j 

p 


n — pw — pb 


(■ b ) 


n — pa 


\b \ 2 — \a \ 2 — (b — a) a 
bb — aa — ba + aa 
b(b — a) = bp. 


Similarly n — pb = ap. Consequently (b) simplifies to 


bp 

— pw 


b — w 


w — b 

ap 

— pw 


a — w 


w — a 


which gives 


1 

A' 
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11. Let a and b be distinct complex numbers and 0 < a < it. 

(i) When z i lies on the circular arc shown, it subtends a constant angle 
a. This angle is given by Arg (zi — a) — Arg ( z\ — b). However 


Arg 


z\ — a 


z\ — b 


= Arg (z i — a) — Arg (z± — b) + 2kn 
= a + 2kiT. 

It follows that k = 0, as 0 < a < it and — ir < Arg 9 < i r. Hence 

' zi — a' 


Arg 


= a. 


z\ — b / 

Similarly if Z 2 lies on the circular arc shown, then 

' Z2 - a ' 


Arg 


Z2~b 


= — = — (ir — a) = a — ir. 


Replacing a by it — a, we deduce that if Z4 lies on the circular arc shown, 
then 

/ N 

Z 4 — a 


Arg 


= it — a, 


Z 4 — b 

while if Z 3 lies on the circular arc shown, then 


Arg 


Z3- a 


= —a. 


A3 ~b. 

The straight line through a and b has the equation 

z = (1 — t)a + tb , 
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where t is real. Then 0 < t < 1 describes the segment ab. Also 

z — a t 


z — b t — 1 


Hence is real and negative if z is on the segment a, but is real and 
positive if z is on the remaining part of the line, with corresponding values 

f z — a\ 

Arg \ J = *’ °’ 

respectively. 

(ii) Case (a) Suppose z i, z 2 and Z3 are not collinear. Then these points 
determine a circle. Now z 1 and 2 2 partition this circle into two arcs. If z 3 
and Z4 lie on the same arc, then 

Arg = Arg ( ; 

\Z3-Z2J \Z4-Z2J 

whereas if z 3 and Z4 lie on opposite arcs, then 


and 


Hence in both cases 


Arg 


f z 3 - z A 
\ Z3 - z 2 J 


= a 


Arg 


Z4 ~ zi 
Z4 - Z 2 


= a — it. 


Arg ( 

\z3-z2 Z4-Z2J 


Arg ( — — Arg ( — (mod 2 n) 

\Z3-Z2J \Z4-Z2J 

0 or 7 r. 


In other words, the cross-ratio 

z 3 - Z\ Z4 - z 1 
z 3 z 2 Z4 — Z2 

is real. 

(b) If z 1, z 2 and z 3 are collinear, then again the cross-ratio is real. 

The argument is reversible. 

(iii) Assume that A, B, C, D are distinct points such that the cross-ratio 


^3 - Zl Z 4 ~ Zl 

r = / 

z 3 - z 2 Z 4 - Z2 

is real. Now r cannot be 0 or 1 . Then there are three cases: 
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(i) 0 < r < 1; 

(ii) r < 0; 

(iii) r > 1. 

Case (i). Here \r\ + |1 — r\ = 1. So 

-Z4 - Zi Z 3 - Z 2 1 _ f Z 4 - Z! Z 3 - Z 2 \ ^ 

Z4- Z 2 Z 3 — Z\ \Z4- z 2 z 3 - ZiJ 

Multiplying both sides by the denominator \z± — z 2 \\z 3 — zi\ gives after 
simplification 

\z 4 - Zl\\z 3 - Z 2 \ + jz 2 - Zl\\z 4 - Z 3 \ = I Z 4 - z 2 \\z 3 - Zl\, 
or 

(a) AD ■ BC + AB ■ CD = BD ■ AC. 

Case (ii). Here 1 + |r| = |1 — r|. This leads to the equation 

(b) BD ■ AC + AD • BC+ = AB ■ CD. 

Case (iii). Here 1 + |1 — r| = |r|. This leads to the equation 

(c) BD • AC + AB ■ CD = AD ■ BC. 

Conversely if (a), (b) or (c) hold, then we can reverse the argument to deduce 
that r is a complex number satisfying one of the equations 

|r| + |1 — r\ = 1, l + |r| = |l — r|, 1 + |1 — r| = |r|, 

from which we deduce that r is real. 
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Section 6.3 


1. Let A = 


Then A has characteristic equation A 2 — 4A + 3 = 0 


4 -3 
1 0 

or (A — 3) (A — 1) = 0. Hence the eigenvalues of A are Ai = 3 and A 2 = 1. 
Ai = 3. The corresponding eigenvectors satisfy (A — XiI 2 )X = 0, or 


' 1 

1 

CO 

1 


1 

0 

1 

CO 

1 


[0 J 


or equivalently x — 3y = 0. Hence 


X 


' 3 y ' 


' 3 ' 

. y _ 


y 

— y 

1 


3 

and we take X\ = 

Similarly for A 2 = 1 we find the eigenvector X 2 = 


Hence if P = [Af,|X 2 ] = 


3 1 
1 1 


then P is non-singular and 


P-'AP = 


3 0 
0 1 


Hence 

and consequently 

A n 


A = P 


3 0 
0 1 


j-i 


= P 


3 n 0 

0 r 


p 


-1 


r. 

3 11 

- gn 

0 

1 


[1 ij 

0 

l n 

2 

- 

1 

■ gn+l 

1 ' 

1 

— 

1 ' 

2 

3” 

1 

-1 


3 


-1 

3 


1 r 3 n+1 - 1 — 3 n+1 + 3 

3 n - 1 -3" + 3 

3 n — 1 3 - 3 n r 

A H 1 2 . 
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2. Let A = 


3/5 4/5 


U J. 1 _ /_ -1 / r * J- XXV.XX VV Vv J.X±X\_X UIlUil 

L 2 / 5 V 5 J 

A 2 = —1/5, with corresponding eigenvectors 


Then we find that the eigenvalues are Ai = 1 and 


X! = 


and X 2 = 


Then if P = [X 1 IX 2 ], P is non-singular and 


P~ X AP = 


1 0 
0 -1/5 


and A = P 


1 0 
0 -1/5 


Hence 


P 1 0 p -i 

[ 0 (— l/5) n J 

p\ 1 °1p-i 
0 0 

' 2 -1 1 r 1 0 1 1 r 11 
1 1 J [ 0 0 J 3 [ -1 2 

1 r 2 0 1 r 11' 

310 -12 


1 r 2 2 
3 1 1 


2/3 2/3 
1/3 1/3 


3. The given system of differential equations is equivalent to X = AX, 
where 

. [3—2] 1 v \ x 

A = . and X = 

[ 5 -4 J L y J 

[2 1 ] 

The matrix P = is a non-singular matrix of eigenvectors corre- 

o 1 

sponding to eigenvalues Ai = —2 and A 2 = 1. Then 


P~ l AP = 


-2 0 


The substitution X = PY, where Y = [xi, yi} 1 , gives 


Y = ~ 2 ° Y 

0 1 ’ 
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or equivalently x’i = — 2x\ and ij\ = y\ . 

Hence x\ = xi(0)e~ 2t and y\ = yi(0)e t . To determine xi(0) and y\ (0), 
we note that 


xi(0) 

= P” 1 

x(0) 

1 

1 -1 ' 

' 13 ' 


' 3 ' 

. 2/i(0) _ 


. 2/(0) _ 

~~ 3 

-5 2 

22 


_ 7 _ 


Hence x\ = 3e 2t and y\ = 7eL Consequently 

x = 2x'i + 2/1 = 6e~ 2f + 7e* and y = 5x i + yi = 15e -2 * + 7e*. 


4. Introducing the vector X n = 


Vn 


the system of recurrence relations 


'J'n+i — 3x n y n 
Vn+ 1 — %n T 3y n , 


becomes X n +i = AX n , where A = 
X 0 = 


3 -1 
-1 3 


. Hence X n = A n X o, where 


To find A n we can use the eigenvalue method. We get 
A n = 


2 n + 4 n 

qn 


2 n q n 

2 n + 4 n 


Hence 


X n = - 


1 

2 71 + 4 n 2 n 4 n 

■ 1 ' 

2 

2^i ^ n 2 n _(_ 4 n 

2 


2 n + 4 n + 2(2 n - 4 n ) 
2 n - 4 n + 2(2 n + 4 n ) 


1 

' 3 x 2 n - 4" ' 


' (3 x 2 n — 4 n ) /2 ' 

2 

3 x 2 n + 4" 


(3 x 2 n + 4 n )/2 


Hence x n = ^(3 x 2 n — 4 n ) and y n = j(3x 2 n + 4 n ). 


5. Let A = 


a 6 
c d 


be a real or complex matrix with distinct eigenvalues 
Ai, A 2 and corresponding eigenvectors X\, X 2 . Also let P = [Xi | AT 2 ] . 

(a) The system of recurrence relations 

= ax n + by n 
= cx n + dy n 


2 - 71+1 
Un + 1 
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has the solution 


where 



= A n 

x 0 

= (p 

Ai 

0 

\ n r 

p - 1 ) 

_ Vn . 


. yo . 

V 

0 

a 2 

) V 


= P 


A" 0 

o xr 2 


P ~i 


x 0 

yo 


= [Xi|x 2 ] 


A? 

0 


0 

\n 

A 2 



= [Xi\X2] 


A ?a 

AJ/9 


A?aXi + A^/3X 2 , 


a 

/? 


= P” 1 


x 0 

yo 


(b) In matrix form, the system is X = AX, where X = 
X = PY, where Y = [x'i, y\] t . Then 

X = PY = AX = A(PY), 


so 


Hence x\ 


Y = ( P~ l AP)Y 


1 

V 

o 

1 


X\ 

6 A 2 


. yi . 


AiXi and iji = A 2 yi • Then 

x'i = xi(0)e Al * and y\ = yi(0)e A2 *. 


But 


' ®(0) ' 

— p 

xi(0) 

. y(0) _ 

— i 

_ yi(o) _ 


so 


Consequently xi(0) 
x 

y 


xi(0) 

— p~ 1 

' x(0) ' 

_ yi(o) _ 


. 3/(0) _ 


= a and yi(0) = (3 and 



[X!\X 2 ] 


= ae Xlt X 1 + (3e X2t X 2 . 


a 

P 


ae Xlt 

Pe X2t 


x 0 

yo 


. We substitute 
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6. Let A = 


b 

d 


be a real matrix with non-real eigenvalues A = a + ib 


and A = a—ib, with corresponding eigenvectors X = U+iV and X = U—iV, 
where U and V are real vectors. Also let P be the real matrix defined by 
P = [£/|V]. Finally let a + ib = re lB , where r > 0 and 8 is real. 


(a) As X is an eigenvector corresponding to the eigenvalue A, we have AX = 
XX and hence 


A(U + iV) = ( a + ib)(U + iV ) 

AU + iAV = aU — bV + i(bU + aV). 

Equating real and imaginary parts then gives 

AU = aU-bV 
AV = bU + aV. 


(b) 


AP = A[U\V } = [AU\AV] = [aU-bV\bU+aV] = [U\V] 


a b 
—b a 


= P 


a b 
—b a 


Hence, as P can be shown to be non-singular, 


P~ 1 AP 


a b 
—b a 


(The fact that P is non-singular is easily proved by showing the columns of 
P are linearly independent: Assume xU + yV = 0, where x and y are real. 
Then we find 


(x + iy)(U — iV ) + (x — iy)(U + iV) = 0. 

Consequently x+iy = 0 as U—iV and U +iV are eigenvectors corresponding 
to distinct eigenvalues a — ib and a + ib and are hence linearly independent. 
Hence x = 0 and y = 0.) 

(c) The system of recurrence relations 

2-71+1 — ax n A by n 

lJ n +\ = cx n + dy n 
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has solution 




Pr n 


XQ 

yo _ 


7 ^ 

a b 

p-i 

—b a 


r cos 0 

rs'md 

— rsin0 

r cos d 

cos 0 

sin d 

— sin d 

cos d 


Xo 

yo 



r n [U\V] 


cos nd sin nd 
— sin nO cos nO 



n r | i a cos nd + (3 sin nd 
? — a sin ri0 + (3 cos nd 

r n {(a cos nd + [3 sin nd) U + (—a sin nO + (3 cos nO)V} 
r n { (cos nd) ( aU + (3V ) + (sin nd) (/3U — aV) } . 


(d) The system of differential equations 

dx 

— = ax + by 

dy , , 

— = cx + dy 


is attacked using the substitution X = PY, where Y = \x\, yi]*. Then 

Y = ( P~ 1 AP)Y ; 


so 

xi _ a b x\ 

3/1 J [ ~b a \ [ yi ' 

Equating components gives 

x\ = ax i + by i 
Vi = -bxi + ayi. 

Now let z = x i + iy\ . Then 

z = xi + iyi = (axi + byi) + i(-bxi + ayi) 

= (a — ib)(x i + iyi) = (a — ib)z. 
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Hence 


^ = z ( 0)e (a - i6)t 

x\ + iyi = (xi(0) + iy \(0))e at (cos bt — i sin bt). 

Equating real and imaginary parts gives 

x\ = e at {aq(0) cos bt + 2/1 (0) sin&t} 

Vi = e at {j/i(0) cos bt — £i(0) sinfet} . 


Now if we define a and j3 by 


a 

P 


= P~ l 


x(0) 

2 /( 0 ) 


we see that a = xi(0) and /3 = y i(0). Then 


x 

y 



[u\v\ 


e at (a cos bt + P sin bt) 
e at \P cos bt — a sin bt) 


e at {(a cos bt + /3sinbt)U + ((3 cos bt — asinftt)!/} 
e at {cos bt(aU + f3V) + sin bt.(/3U — aV)}. 


b 

d 


and suppose that 


7. (The case of repeated eigenvalues.) Let A = 

the characteristic polynomial of A, A 2 — (a + d)X + (ad — be), has a repeated 
root a. Also assume that A ^ a/ 2 . 


(i) 


A 2 — (a + d)\ + (ad — be) = (A — a) 2 

= A 2 — 2aA + a 2 . 


Hence a + d = 2a and ad — be = a 2 and 


(a + d) 2 
a~* -\~ 2 ad T d~ 
a 2 — 2 ad + d 2 + 46c 
(a — d) 2 + 46c 


4 (ad — be), 
4 ad — 46c, 
0 , 

0 . 
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(ii) Let B — A — al 2 ■ Then 
B 2 = (A- al 2 ) 2 


= A 2 - 2aA + a 2 h 
= A 2 — (a + d)A + (ad — bc)h , 


But by problem 3, chapter 2.4, A 2 — (a + d)A + (ad — bc)h = 0, so 
B 2 = 0. 


(iii) Now suppose that B ^ 0. Then BE\ ^ 0 or BE 2 7 ^ 0, as BEi is the 
i-th column of B. Hence BX 2 i=- 0, where X 2 = E\ or X 2 = E 2 . 


(iv) Let X\ = BX 2 and P = [Xil-X^]. We prove P is non-singular by 
demonstrating that X\ and X 2 are linearly independent. 

Assume xX\ + yX 2 = 0. Then 


xBX 2 + yX 2 = 0 
B(xBX 2 + yX2) = HO = 0 
xB 2 X 2 + yBX 2 = 0 
xOX 2 + yBX 2 = 0 
yBX 2 = 0. 


Hence y = 0 as BX 2 / 0. Hence xBX 2 = 0 and so x = 0. 
Finally, BX V = B(BX 2 ) = B 2 X 2 = 0, so (A - aI 2 )X 1 = 0 and 

AXi = aX\. (2) 


Also 

X\ = BX 2 = (A - aI 2 )X 2 = AX 2 - aX 2 . 

Hence 

AX 2 = Xi + aX 2 . (3) 

Then, using (2) and (3), we have 

AP = A[X i|A 2 ] = [AX^AXi] 

= [aX l \X l + aX 2 ] 
a 1 


= [Ai|A 2 ] 


0 a 


Hence 


and hence 


AP = P 


a 1 
0 a 
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P~ X AP = 


a 1 
0 a 


equation X = AX, where A = 


8. The system of differential equations is equivalent to the single matrix 

4 -1 
4 8 

_i 

The characteristic polynomial of A is A 2 — 12A + 36 = (A — 6) 2 , so we 
can use the previous question with a = 6. Let 


B = A — 61-2 = 


-2 -1 
4 2 


Then BX 2 = 


-2 
4 

P = [X 1 IX 2 ], we have 




if X 2 = 


P~ l AP = 


1 

0 

6 1 
0 6 


Also let X]_ = BX 2 . Then if 


Now make the change of variables X = PY , where Y = 


xi 

Vi 


Then 


Y = ( P~ l AP)Y = 


6 1 
0 6 


Y. 


or equivalently x\ = 6 x 1 + y\ and y’i = 6 yi . 

Solving for y\ gives y\ = yi(0)e 6 *. Consequently 

x'i = 6 x 1 + yi( 0 )e w . 

Multiplying both side of this equation by e _6i gives 

^(e“ 6 t xi) = e _ 6 t x'i - 6 e _w xi = yi( 0 ) 

e~ 6t xi = yi( 0 )t + c, 

where c is a constant. Substituting t = 0 gives c = xi(0). Hence 

e~ 6t xi = yi(0)t + xi( 0 ) 


and hence 

xi = e 6< (yi(0)t + xi(0)). 
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However, since we are assuming x'(0) = 1 = y( 0), we have 


xi(0) 
2/1 ( 0 ) 


0 -1 ' 

' 1 ' 

1 

' -1 ' 


' 1/4 ' 

-4 -2 

1 

“ ^4 

-6 


3/2 


= P 


-l 


x(0) 

2 /( 0 ) 


Hence x\ = e 6t (| t + |) and yi = |e 6t . 
Finally, solving for x and y, 


X 


' -2 

1 ' 


Xl 

_ y _ 


4 

0 


. 2/1 . 


-2 1 
4 0 


e 6 t (i*+D 

3 61 
2 e 

(-2)e M (|f+|) + |e 6t 

4e w (lt+|) 


e 6t (l - 3f) 
e w (6t + 1) 


Hence x = e 6t (l — 3t) and y = e m (6t + 1). 


9. Let 


A = 


1/2 1/2 0 
1/4 1/4 1/2 
1/4 1/4 1/2 


(a) We first determine the characteristic polynomial ch^A). 


clu(A) 


det (A/3 — A) 


A - 1/2 - 1/2 0 

1/4 A -1/4 -1/2 

-1/4 -1/4 A -1/2 
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(b) Hence the characteristic polynomial has no repeated roots and we can 
use Theorem 6.2.2 to find a non-singular matrix P such that 

P~ l AP = diag(l, 0, |). 

We take P = [X1JX2IX3], where X\, X2, X3 are eigenvectors corresponding 
to the respective eigenvalues 1,0, 

Finding X \ : We have to solve (A — I$)X = 0. we have 

' -1/2 1/2 0 1 [ 1 0 -1 ' 

A-h= 1/4 -3/4 1/2 -> 0 1-1 . 

1/4 1/4 -1/2 J [ 0 0 0 _ 

Hence the eigenspace consists of vectors X = [x, y, z] 1 satisfying x = z and 

y = z, with z arbitrary. Hence 

' z 1 r 1 ' 

X = 2 =2 1 

_Z \ [ 1 _ 

and we can take X\ = [1, 1, l]h 
Finding X 2: We solve AX = 0. We have 

'1/2 1/2 0 1 [ 1 10" 

A = 1/4 1/4 1/2 0 0 1 . 

_ 1/4 1/4 1/2 J [ 0 0 0 _ 

Hence the eigenspace consists of vectors X = [x, y, z] f satisfying x = —y 
and z = 0, with y arbitrary. Hence 



and we can take X2 = [—1, 1, 0]*. 

Finding X3 : We solve (A — \l$)X = 0. We have 

T 1/4 1/2 0 1 r 1 0 2 

A--h= 1/4 0 1/2 — > 0 1—1 

4 |_ 1/4 1/4 1/4 J [ 0 0 0 
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Hence the eigenspace consists of vectors X = [x, y, z]* satisfying x = —2 z 
and y = z, with z arbitrary. Hence 

' - 2 z i r- 2 - 

X = z = z 1 

0 J L 0 . 

and we can take X% = [—2, 1, 1]*. 

[ 1 -1 -2 ' 

Hence we can take P = 1 1 1 

1 0 1 _ 

(c) A = Pdiag(l, 0, \)P~ l so A n = Pdiag(l, 0, ^)P~ l . 

Hence 


A n 



1 

3 


' 1 

1 

1 ' 

1 

+ 3 • 4 n 

to 

2 

1 

1 

1 

1 

1 

-1 

-1 

2 

1 

1 

1 


_ -1 

-1 

2 _ 



10. Let 



(a) We first determine the characteristic polynomial ch^A). 



A- 5 -2 2 

R 3 —> R 3 + 7 ? 2 

A - 5 -2 2 

clu(A) = 

-2 A - 5 2 

-2 A - 5 2 


2 2 A — 5 


0 A — 3 A — 3 


A - 5 -2 2 
= (A - 3) -2 A - 5 2 

0 1 1 
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C :i -> C:i - C 2 


(A -3) 


A - 5 -2 4 

-2 A- 5 -A + 7 
0 1 0 


— —(A — 3) {(A — 5)(— A + 7) + 8} 

= —(A — 3)(— A 2 + 5A + 7A — 35 + 8) 
= —(A — 3)(— A 2 + 12A — 27) 

- —(A — 3)(— 1)(A — 3)(A — 9) 

= (A — 3) 2 (A — 9). 


We have to find bases for each of the eigenspaces N(A — 9 / 3 ) and N(A — 3Is). 
First we solve [A — 3 / 3 )^ = 0. We have 


to 

2 

CM 

1 


' l 

1 

T— 1 

1 

2 

2 

CM 

1 

-> 

0 

0 

0 

1 

1 

to 

CM 

1 

to 

1 


1 

0 

0 

1 

0 


Hence the eigenspace consists of vectors X = \x, y, z]* satisfying x = — y + z, 
with y and z arbitrary. Hence 


X = 

" -y + z' 

y 

= y 

' -1 ' 
1 

+ z 

' 1 ' 
0 


z 


0 


1 


so X\ = [—1, 1, 0]* and X 2 = [1, 0, 1]* form a basis for the eigenspace 
corresponding to the eigenvalue 3. 

Next we solve (A — 9I^)X = 0. We have 


A — 9/3 


1 

1 

2 

1 

CM 

! 


' 1 

0 

1 ' 

2 

-4 

CN 

1 

-> 

0 

1 

1 

1 

1 

to 

CM 

1 

1 

1 


1 

0 

0 

1 

0 


Hence the eigenspace consists of vectors X = [x, y, z]* satisfying x = —z 
and y = —z, with z arbitrary. Hence 



—z 


' -1 ' 

X = 

—z 

= z 

-1 


z 


1 


and we can take X 3 = [— 1 , — 1 , 1 ] 1 as a basis for the eigenspace correspond- 
ing to the eigenvalue 9. 
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Then Theorem 6.2.3 assures us that P = [X 1 JX 2 IX 3 ] is non-singular and 


P-'AP = 


3 0 0 
0 3 0 
0 0 9 
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Figure 1: (a): x 2 — 8x + 8y + 8 = 0; (b): y 2 — 12.x + 2y + 25 = 0 

Section 7.3 

1. (i) x 1 2 — 8x+8y+8 = (x— 4) 2 +8(y— 1). So the equation x 2 — 8x+8y+8 = 0 
becomes 

x\ + 8yi = 0 (1) 

if we make a translation of axes x — 4 = x±, y — 1 = y\. 

However equation (1) can be written as a standard form 

1 2 

Vi = 

which represents a parabola with vertex at (4, 1). (See Figure 1(a).) 

(ii) y 2 — 12x + 2y + 25 = (y + l) 2 — 12(x — 2). Hence y 2 — 12x + 2y + 25 = 0 
becomes 

y\ - 12xi = 0 (2) 

if we make a translation of axes x — 2 = xi, y + 1 = y\. 

However equation (2) can be written as a standard form 

y\ = 12xi, 

which represents a parabola with vertex at (2, —1). (See Figure 1(b).) 

2. 4 xy — 3 y 2 = X t AX 1 where A = ^ \ and X = 1 The 

2 -3 J [ y 

eigenvalues of A are the roots of A 2 + 3A — 4 = 0, namely Ai = —4 and 
A 2 = 1. 
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The eigenvectors corresponding to an eigenvalue A are the non-zero vec- 
tors \x, y ]* satisfying 


Ai 


0 - A 
2 



x 


' 0 ' 


. y . 


0 


—4 gives equations 


4x + 2y = 0 
2 x + y = 0 


which has the solution y = —2x. Hence 


X 


X 


1 ' 

. y . 


—2x 

— X 

-2 


A corresponding unit eigenvector is [l/\/5, — 2/v / 5] t . 
A 2 = 1 gives equations 


—x + 2 y = 0 

2x — 4 y = 0 

which has the solution x = 2 y. Hence 


X 


' 2 y ' 


' 2 ' 

. v _ 


y _ 

— y 

1 


A corresponding unit eigenvector is [2/\/5, l/v / 5] t - 
Hence if 


1 2 

-2 1 > 
V5 


then P is an orthogonal matrix. Also as det P = 1, P is a proper orthogonal 
matrix and the equation 


x 

y 



represents a rotation to new xq , yi axes whose positive directions are given 
by the respective columns of P. Also 


P f AP 


-4 0 
0 1 
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Then X t AX = —4xf + y 2 and the original equation 4 xy — 3 y 2 = 8 becomes 
—4xf + y 2 = 8, or the standard form 






which represents an hyperbola. 

The asymptotes assist in drawing the curve. They are given by the 
equations 


^2 + ^=0, or V1= ±2 X1 . 


Now 


x\ 

y\ 


= p l 


X 

y 


1 —2 

— VE vT 
~ 2 1 

VE VE 


x 

y 


so 


Xl = 

Hence the asymptotes are 


x — 2y 

7T’ 


y i = 


2x + y 

7T' 


2x + y 

~~w 


= ± 2 


x — 2y 


which reduces to y = 0 and y = 4x/3. (See Figure 2(a).) 


3. 8x 2 — 4xy + 5 y 2 = X l AX, where A = 


8 -2 
-2 5 


and X = 


x 

y 


. The 


eigenvalues of A are the roots of A 2 — 13A + 36 = 0, namely Ai = 4 and 
A 2 = 9. Corresponding unit eigenvectors turn out to be [l/\/5, 2/\/5 ]* and 
[-2/>/5, 1/V5]*- Hence if 

1 -2 

p \/5 \/5 

7 — 2 1 ) 

vT 

then P is an orthogonal matrix. Also as det P = 1, P is a proper orthogonal 
matrix and the equation 


x 

y 



represents a rotation to new x \ , yi axes whose positive directions are given 
by the respective columns of P. Also 


P l AP 


4 0 
0 9 
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Figure 2: (a): 4 xy — 3 y 2 = 8; (b): 8x 2 — 4 xy + 5 y 2 = 36 

Then X t AX = 4x\ + 9 y\ and the original equation 8x 2 — 4 xy + 5 y 2 = 36 
becomes 4x 2 + 9 y 2 = 36, or the standard form 

A , Vi = t 

9 + 4 

which represents an ellipse as in Figure 2(b). 

The axes of symmetry turn out to be y = 2x and x = —2 y. 

4. We give the sketch only for parts (i), (iii) and (iv). We give the working 
for (ii) only. See Figures 3(a) and 4(a) and 4(b), respectively. 

(ii) We have to investigate the equation 

5x 2 — 4 xy + 8 y 2 + 4\/^>x — 16 >/5 y + 4 = 0. (3) 

T 5 —2 1 

Here 5x 2 — 4 xy + 8 y 2 = X t AX 1 where A = and X = 

— 2 8 

The eigenvalues of A are the roots of A 2 — 13A + 36 = 0, namely Ai = 9 and 
A 2 = 4. Corresponding unit eigenvectors turn out to be [l/\/5, —2j\f^\ and 
[2/v/5, Hence if 

1 2 

0 _ y/E a/5 

1 ~ - 2 1 > 

+5 +5 

then P is an orthogonal matrix. Also as det P = 1, P is a proper orthogonal 
matrix and the equation 

x = p x 1 

V \ [ Vi 
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represents a rotation to new x \ , y\ axes whose positive directions are given 
by the respective columns of P. Also 


P l AP 


9 0 
0 4 


Moreover 

5x 2 — 4 xy + 8 y 2 = 9xf + 4 y\. 

To get the coefficients of x\ and y± in the transformed form of equation (3), 
we have to use the rotation equations 


x = ^=(x 1 +2yi), y = -^=(-2x 1 + yi). 


Then equation (3) transforms to 


9xl + %i + - 8yi + 4 = 0, 


or, on completing the square, 

9(xi + 2) 2 + 4(yi — l) 2 = 36, 


or in standard form 

A , vl = 

4 + 9 

where X 2 = x\ + 2 and y 2 = yi — 1. Thus we have an ellipse, centre 
(x 2 , 2 / 2 ) = (0, 0), or Oi, yi) = (-2, 1), or (x, y) = (0, \/5). 

The axes of symmetry are given by X 2 = 0 and y 2 = 0, or x\ + 2 = 0 
and yi — 1 = 0, or 


1 

71 


(x 


2y) + 2 


0 and 


-7(2x + y) -1 = 0, 


which reduce to x — 2y + 2y / 5 = 0 and 2x + y — \/5 = 0. See Figure 3(b). 
5. (i) Consider the equation 

2x 2 + y 2 + 3xy — 5x — 4y + 3 = 0. (4) 


A = 

2 3/2 -5/2 
3/2 1 -2 

= 8 

4 3-5 

3 2-4 

= 8 

1 1 -1 

3 2-4 


-5/2 -2 3 


-5 -4 6 


-2 -2 2 
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Let x = x\ + a, y = yi + (3 and substitute in equation (4) to get 

2(xi + a) 2 + (yi + P) 2 + 3(xi + a)(yi+P)-5(xi + a)-4(yi + /3) + 3 = 0 (5). 

Then equating the coefficients of x\ and y\ to 0 gives 

4a + 3(3 - 5 = 0 
3a + 2/3 — 4 = 0, 

which has the unique solution a = 2, (3 = — 1. Then equation (5) simplifies 
to 

2xf + y\ + 3xiyi = 0 = (2aq + yi)(xi + yi). 

So relative to the xi, y\ coordinates, equation (4) describes two lines: 2xi + 
2 /i = 0 and xi+yi = 0. In terms of the original x , y coordinates, these lines 
become 2(x — 2) + (y + 1) = 0 and (x — 2) + (y + 1) = 0, i.e. 2x + y — 3 = 0 
and x + y — 1 = 0, which intersect in the point 

(x, y) = (a, (3) = (2, -1). 

(ii) Consider the equation 

9x 2 + y 2 — 6 xy + 6x — 2y + 1 = 0. (6) 

Here 

9-3 3 

A = 3 1-1=0, 

3 -1 1 

as column 3 = — column 2. 

Let x = xi + a, y = y\ + /3 and substitute in equation (6) to get 

9(xi + a) 2 + (yi + (3) 2 - 6(xi + a){y\ + (3) + 6(xi + a) - 2(yi + f3) + 1 = 0. 

Then equating the coefficients of xi and y\ to 0 gives 

18a — 6/3 + 6 = 0 
—6a + 2/3 — 2 = 0, 

or equivalently —3a + (3 — 1 = 0. Take a = 0 and (3 = 1. Then equation (6) 
simplifies to 

9x 2 + y 2 - 6xii/i = 0 = (3xi - yi) 2 . (7) 
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In terms of x, y coordinates, equation (7) becomes 

( 3x — (y — l)) 2 = 0, or 3x — y + 1 = 0. 

(iii) Consider the equation 

x 2 + 4 xy + 4 y 2 — x — 2y — 2 = 0. (8) 

Arguing as in the previous examples, we find that any translation 

x = xi ± a, y = yi + (5 

where 2a + 4/3 — 1 = 0 has the property that the coefficients of x\ and y\ 
will be zero in the transformed version of equation (8). Take (3 = 0 and 
a = 1/2. Then (8) reduces to 

9 9 9 

aq + 4aqyi + 4y x - - = 0, 

or (x'i +2yi) 2 = 3/2. Hence x\ + 2y\ = ±3/2, with corresponding equations 
x ± 2y = 2 and x + 2y = — 1. 
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Section 8.8 


1. The given line has equations 

x = 3 ± f(13 — 3) = 3 ± lOf, 
y = — 2 ± t{ 3 + 2) = —2 + 5 t, 
z = 7 + t(— 8 — 7) = 7 — 15t. 

The line meets the plane y = 0 in the point (x, 0, z), where 0 = —2 ± 5 1, or 
t = 2/5. The corresponding values for x and z are 7 and 1, respectively. 

2. E = ^(B + C), F = (1 — t) A ± tE, where 

^_ AF _ AF _ AF/FE _ 2 
' ~ ~AE ~ AF + FE ~ {AF/FE) + 1 “ 3' 

Hence 

f - s a+ ^G ( b+c) ) 

- S A 4 (B+C) 

= i(A + B + C). 

3. Let A = (2, 1, 4), B = (1, —1, 2), C = (3, 3, 6). Then we prove AC= 
t AB for some real t. We have 



' 1 ' 


' -1 ' 

AC= 

2 

, AB= 

-2 


2 


-2 


Hence AC= (—1) AB and consequently C is on the line AB. In fact A is 
between C and B, with AC = AB. 

4. The points P on the line AB which satisfy AP = | PB are given by 

P = A + t AB, where |f/(l — t)\ = 2/5. Hence t/{ 1 — t) = ±2/5. 

The equation t/{ 1 — t) = 2/5 gives t = 2/7 and hence 


2 ' 

2 

' 1 ' 


' 16/7 ' 

3 

+ 7 

4 

= 

29/7 

-1 

5 


3/7 
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Hence P = (16/7, 29/7, 3/7). 

The equation t/( 1 — t) = —2/5 gives t = —2/3 and hence 


P = 


2 

3 

-1 



4/3 ' 
1/3 

-13/3 _ 


Hence P = (4/3, 1/3, —13/3). 

5. An equation for A4 is P = A + 1 BC, which reduces to 


x = 1 + 6t 

y = 2 — 3 1 

z = 3 + 7 1. 


An equation for AAis Q = E + s EF, which reduces to 

x = 1 + 9-s 

y = -i 

2 i = 8 + 3-s. 


To find if and where A4 and M intersect, we set P = Q and attempt to solve 
for s and t. We find the unique solution t = 1, s = 2/3, proving that the 
lines meet in the point 

(s, y, Z) = (1 + 6, 2 - 3, 3 + 7) = (7, -1, 10). 


6. Let A = (3, 5, 6), B = (-2, 7, 9), C = (2, 1, 7). Then 

(i) 


cos ZABC = {BA ■ BC)/{BA ■ BC), 


where BA= [—1, —2, —3]* and BC= [4, —6, — 2] 4 . Hence 


cos ZABC 


-4 + 12 + 6 _ 14 _ 1 

\/l4\/56 ~ \/l4\/56 ~ 2 


Hence ZABC = n/3 radians or 60°. 



(ii) 

cos ABAC = (AB • AC) /( AB ■ AC ) , 

where AB= [1, 2, 3]* and AC= [5, —4, 1]*. Hence 

cos ABAC = 5 I . 8 = 0. 

\/l4\/42 

Hence ZABC = tt/2 radians or 90°. 

(hi) 


cos ZACB = ( CA ■ CB)/(CA ■ CB), 

where CA= [—5, 4, — l] 4 and CB= [—4, 6, 2]*. Hence 

20 + 24 - 2 42 a/42 ^3 

\/42\/56 a/42a/56 \/56 2 

Hence ZACB = n/Q radians or 30°. 


7. By Theorem 8.5.2, the closest point P on the line AB to the origin O is 
given by P = A + t AB, where 


~AO AB _ -A- AB 
AB 2 “ AB 2 


Now 


A- ~AB= 



= - 2 . 


Hence t = 2/11 and 



' -2 ' 

2 

' 3 ' 


' -16/11 ' 

p = 

1 

+ IT 

1 

= 

13/11 


3 

1 


35/11 


and P = (-16/11, 13/11,35/11). 

Consequently the shortest distance OP is given by 


/ — 16\ 2 / 13\ 2 /35\ 2 _ a/1650 _ V15 x 11 x 10 _ \/l50 

\~rT ) + VTTy + “ n “ ii ~ W 
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Alternatively , we can calculate the distance OP 2 , where P is an arbitrary 
point on the line AB and then minimize OP 2 : 


Hence 


P = A + t AB= 


' -2 ' 


' 3 ' 


" -2 + St' 

1 

+ t 

1 

= 

1 At 

3 


1 


3 + 1 


OP 2 


Consequently 


for all t: moreover 


when t = 2/11. 


(—2 + 3f) 2 + (1 + t) 2 + (3 + t) 2 
lit 2 - At + 14 



OP 2 > 11 x 


150 

121 


OP 2 = 11 X — 
121 


8. We first find parametric equations for N by solving the equations 


x + y — 2z = 1 

x + 3y — z = 4. 


The augmented matrix is 

'1 1 -2 1 ' 

1 3 -1 4 J ’ 

which reduces to 

' 1 0 -5/2 -1/2 ' 

0 1 1/2 3/2 ' 

Hence x = — | + | z, y = | — with z arbitrary. Taking z = 0 gives a point 
A = (— j, 0), while z = 1 gives a point B = (2, 1, 1). 
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Hence if C = (1, 0, 1), then the closest point on M to C is given by 

P = A + t AB, where t = (AC ■ AB)/AB 2 . 

Now 


3/2 


5/2 

-3/2 

and AB= 

-1/2 

1 


1 


t = 


3 

2 


x| + Jx J + lxl 

(§) 2 +(^) 2 +l 2 


11 

15' 


Hence 



" -1/2 ' 

11 

5/2 


4/3 

P = 

3/2 

+ 15 

-1/2 

= 

17/15 


0 _ 

1 


11/15 


so P = (4/3, 17/15, 11/15). 

Also the shortest distance PC is given by 


PC 



v/330 

15 


9. The intersection of the planes x + y — 2z = 4 and 3x — 2 y -\- z = 1 is the 
line given by the equations 

9 3 11 7 

x = ~ + -z, y = — + -z, 

5 5 5 5 

where z is arbitrary. Hence the line C has a direction vector [3/5, 7/5, 1]* 
or the simpler [3, 7, 5]*. Then any plane of the form 3x + 7y + 5z = d will 
be perpendicualr to C. The required plane has to pass through the point 
(6, 0, 2), so this determines d: 

3x6 + 7x0 + 5x2 = d = 28. 


10. The length of the projection of the segment AB onto the line CD is 
given by the formula 

| CD ■ AB | 

CD ' 

Here CD= [—8, 4, —1]* and AB= [4, —4, 3]*, so 

\CD-AB\ _ |(— 8) x 4 + 4 x (-4) + (-1) x 3| 

CD “ v /(_8) 2 + 4 2 +FI)2 

I - 51 l _ 5i _ 17 
v/8l 9 “ 3 ' 
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11. A direction vector for C is given by BC= [—5, —2, 3]*. Hence the plane 
through A perpendicular to £ is given by 

—5x -2y + 3z = (-5) x 3 + (-2) x (-1) + 3 x 2 = -7. 


The position vector P of an arbitrary point P on C is given by P = B+f BC, 
or 


r*i 


' 2 ' 


r -5 1 

i 

i 

— 

1 

4 

+ t 

1 

CM CO 

1 

l 


or equivalently x = 2 — 5t, y = 1 — 2t, z = 4 + 3t. 

To find the intersection of line C and the given plane, we substitute the 
expressions for x, y, z found in terms of t into the plane equation and solve 
the resulting linear equation for t: 


-5(2 - 5i) - 2(1 - 2 1) + 3(4 + 3 t) = -7, 


which gives t = —7/38. Hence P = §§, and 


AP 



y/U 1U v/293 x 38 v/293 


38 38 v/38 ' 


12. Let P be a point inside the triangle ABC. Then the line through P and 
parallel to AC will meet the segments AB and BC in D and E. respectively. 
Then 


P 

= (1 — r)D + rE, 

0 < r < 1 

D 

= (1 — s)B + -sA, 

0 < s < 1 

E 

= (1 — t)B + tC, 

0 < t < 1. 


Hence 

P = (l-r){(l-s)B + sA} + r{(l-f)B + fC} 

= (1 — r)sA + {(1 — r)(l — s) + r( 1 — t)} B + rtC 

= aA + /3B+ C, 
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where 


a = (1 — r)s, (3 = (1 — r)(l — s) + r( 1 — t), = rt. 

Then 0 < a < 1, 0< <1, 0 < (3 < (1 — r) + r = 1. Also 

a + j3 + = (1 — r)s + (1 — r)(l — <s) + r(l — t) + rt = 1. 

13. The line AS is given by P = A + i[3, 4, 5]*, or 

x = 6 + 3t, y=— l + 4t, z = ll + 5i. 

Then S is found by substituting these expressions in the plane equation 

3x + Ay + 5z = 10. 

We find t = —59/50 and consequently 


/ 177 236 295 \ _ / 123 -286 255\ 

“v _ "50"’ _ ~ "50"’ _ "50"J “ V^o"’ “50 - ’ "50"y ■ 


Then 


AB 



1*1^32 + 4 2 + 5 2 


59 f— 

50 X%/ “ 


59 

a/50 


14. Let A = (—3, 0, 2), B = (6, 1, 4), C = (—5, 1, 0). Then the area of 
triangle ABC is g|| AB x AC ||. Now 


AB x AC= 


' 9 ' 


' -2 ' 


' -4 ' 

1 

X 

1 

= 

14 

2 


-2 


11 _ 


Hence || AB x AC || = \/333. 

15. Let A\ = (2, 1, 4), A -2 = (1, —1, 2), A3 = (4, —1, 1). Then the point 
P = (x, y, z ) lies on the plane A1A2A3 if and only if 

Ai P -(AiA 2 x A1A3) = 0, 
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or 

x — 2 y — 1 z — A 

-1 -2 -2 = 2x — 7y + 6z — 21 = 0. 

2 -2 -3 

16. Non-parallel lines C and A4 in three dimensional space are given by 
equations 

P = A + sX, Q = B + tY. 

(i) Suppose PQ is orthogonal to both X and Y. Now 

PQ= Q - P = (B + tY) - (A + sX) =AB +tY - sX. 

Hence 

(AB +tY + sX)-X = 0 
(AB +tY + sX)-Y = 0. 

More explicitly 


t(Y -X)-s(X-X) = - AB X 
t(Y ■ Y) - s{X ■ Y) = -AB Y. 

However the coefficient determinant of this system of linear equations 
in t and s is equal to 

= -(X -Y) 2 + (X ■ X)(Y -Y) 

= \\XxY\\ 2 ^0, 

as X / 0, Y 0 and X and Y are not proportional (X and Xi are 
not parallel). 

(ii) P and Q can be viewed as the projections of C and D onto the line PQ, 
where C and D are arbitrary points on the lines C and A4 , respectively. 
Hence by equation (8.14) of Theorem 8.5.3, we have 

PQ < CD. 

Finally we derive a useful formula for PQ. Again by Theorem 8.5.3 


Y X -X ■ X 

Y Y —X • Y 


PQ 


AB • PQ 
PQ 


AB -h 
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X 


where n = — PQ is a unit vector which is orthogonal to X and Y. 
Hence 


h = t(X x H), 

where t = ±1/||X x Y"||. Hence 


PQ 


AB \X x y)| 

11^ X y|| 


17. We use the formula of the previous question. 
Line C has the equation P = A + sX, where 


X =AC= 


2 

-3 

3 


Line A4 has the equation Q = B + tY, where 


y =BD= 


1 

1 

1 


Hence X x Y = [-6, 1, 5]* and \\X xY\\ 


a/62- 
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Hence the shortest distance between lines AC and BD is equal to 


AB -(X x Y)| 
WXx Y\\ 



0 


1 

Oi 



CM 

1 


l 



1 . 


Oi 



v/62 


a/62 


18 . Let E be the foot of the perpendicular from A 4 to the plane A1A2A3. 
Then 

vol A1A2A3A4. = -(area AA1A2A3) ■ A4E. 


Now 


areaAHi^Ha = -|| A1A2 x A1A3 


Also A4E is the length of the projection of A1A4 onto the line A4E. See 
figure below.) 

Hence A4E = \ A1A4 -X\, where A is a unit direction vector for the line 
A4E. We can take 

A1A2 x A1A3 


X = 


A1A2 x A1A3 


Hence 


1 7 — ► - — ► I A1A4 \A\A2 x 

V0IA1A2A3A4 = — A\A2 x A1A3 — 

6 “ “ 


A1A2 X A1A3 


~ r I ^ 1^4 -(A\A2 X A1A3) 
6 
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19. We have CB= [1, 4, 


\\{MA 2 


x A\A%)- AiA± 


— 1]*, CD= [—3, 3, 0]*, AD= [3, 0, 3]*. Hence 
CB x CD= 3i + 3j + 15k, 

so the vector i + j + 5k is perpendicular to the plane BCD. 

Now the plane BCD has equation x + y + 5z = 9, as B = (2, 2, 1) is on 
the plane. 

Also the line through A normal to plane BCD has equation 


X 


■ 1 " 


' 1 " 


' 1 ' 

y 

z 

— 

1 

5 

+ t 

1 

5 

— (! + *) 

1 

5 


Hence x = 1 + t, y = 1 + t, z = 5(1 + t). 

[We remark that this line meets plane BCD in a point E which is given 
by a value of t found by solving 

(1 + t) + (1 + t) + 5(5 + 5 1) = 9. 

So t = -2/3 and E = (1/3, 1/3, 5/3).] 

The distance from A to plane BCD is 

|lxl + lxl + 5x5-9|_ 18 _ oA5 - 
l 2 + l 2 + 5 2 ~ V-T 2X/3 ' 

To find the distance between lines AD and BC, we first note that 

(a) The equation of AD is 


' 1 ' 


' 3 ' 


' 1 + 3f ' 

1 

+ t 

0 

= 

1 

5 


3 


5 + 3f 


(b) The equation of BC is 


' 2 " 


1 " 


2 + s 

2 

+ s 

4 

= 

2 + 4.s 

1 


-1 


1 — s 
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D 


Then PQ= [1 + s — 3f, 1 + 4s, —4 — s — 3i] 4 and we find s and t by solving 
the equations PQ ■ AD= 0 and PQ ■ BC= 0, or 

(1 + s — 3t)3 + (1 + 4s)0 + (—4 — s — 3t)3 = 0 
(1 + s — 3£) + 4(1 + 4s) — (—4 — s — 3i) = 0. 

Hence t = — 1/2 = s. 

Correspondingly, P = (—1/2, 1, 7/2) and Q = (3/2, 0, 3/2). 

Thus we have found the closest points P and Q on the respective lines 
AD and BC. Finally the shortest distance between the lines is 

PQ = || PQ || =3. 


98 



